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PROTEIN MODIFICATION AND MAINTENANCE MOLECULES 



TECHNICAL FIELD 

The invention relates to novel nucleic acids, protein modification and maintenance molecules 
encoded by these nucleic acids, and to the use of these nucleic acids and proteins in the diagnosis, 
treatment, and prevention of gastrointestinal, cardiovascular, autoimmune/inflammatory, cell 
proliferative, developmental, epithelial, neurological, and reproductive disorders. The invention also 
relates to the assessment of the effects of exogenous compounds on the expression of nucleic acids 
and protein modification and maintenance molecules. 

BACKGROUND OF THE INVENTION 
The cellular processes regulating modification and maintenance of protein molecules 
coordinate their function, conformation, stabilization, and degradation. Each of these processes is 
mediated by key enzymes or proteins such as kinases, phosphatases, proteases, protease inhibitors, 
isomerases, transferases, and molecular chaperones. 
Kinases 

Kinases catalyze the transfer of high-energy phosphate groups ft^om adenosine triphosphate 
(ATP) to target proteins on the hydroxyamino acid residues serine, threonine, or tyrosine. Addition 
of a phosphate group alters the local charge on the acceptor molecule, causing internal 
conformational changes and potentially influencing intermolecular contacts. Reversible protein 
phosphorylation is the ubiquitous strategy used to control many of the intracellular events in 
eukaryotic cells. It is estimated that more than ten percent of proteins active in a typical mammaUan 
cell are phosphorylated. Extracellular signals including hormones, neurotransmitters, and growth and 
differentiation factors can activate kinases, which can occur as cell surface receptors or as the 
activators of the final effector protein, as well as elsewhere along the signal transduction pathway. 
Kinases are involved in all aspects of a cell's function, from basic metabolic processes, such as 
glycolysis, to cell-cycle regulation, differentiation, and communication with the extracellular 
environment through signal transduction cascades. Inappropriate phosphorylation of proteins in cells 
has been linked to changes in cell cycle progression and cell differentiation. Changes in the cell cycle 
have been linked to induction of apoptosis or cancer. Changes in cell differentiation have been linked 
to diseases and disorders of the reproductive system, immune system, and skeletal muscle. 

There are two classes of protein kinases. One class, protein tyrosine kinases (PTKs), 
phosphorylates tyrosine residues, and the other class, protein serine/threonine kinases (STKs), 
phosphorylates serine and threonine residues. Some PTKs and STKs possess structural 
characteristics of both families and have dual specificity for both tyrosine and serine/threonine 
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residues. Almost all kinases contain a conserved 250-300 amino acid catalytic domain containing 
specific residues and sequence motifs characteristic of the kinase family. (Reviewed in Hardie, G. 
and S. Hanks (1995) The Protein Kinase Facts Book, Vol I, Academic Press, San Diego, CA, pp. 17- 
20). 

Phosphatases 

Phosphatases hydrolytically remove phosphate groups from proteins. Phosphatases are 
essential in determining the extent of phosphorylation in the cell and, together with kinases, regulate 
key cellular processes such as metabolic enzyme activity, proliferation, cell growth and 
differentiation, cell adhesion, and ceD cycle progression. Protein phosphatases are characterized as 
either serine/threonine- or tyrosine-specific based on thek preferred phospho-amino acid substrate. 
Some phosphatases (DSPs, for dual specificity phosphatases) can act on phosphorylated tyrosine, 
serine, or threonine residues. The protem serine/threomne phosphatases (PSPs) are important 
regulators of many cAMP-mediated hormone responses in cells. Protein tyrosine phosphatases 
(PTPs) play a significant role in cell cycle and cell signaling processes. 
Proteases 

Proteases cleave proteins and peptides at the peptide bond that forms the backbone of the 
protem or peptide cham. Proteolysis is one of the most important and frequent enzymatic reactions 
that occurs both within and outside of cells. Proteolysis is responsible for the activation and 
maturation of nascent polypeptides, the degradation of misfolded and damaged proteins, and tfie 
^ controlled turnover of peptides within the ceU. Proteases participate hi digestion, endocrine function, 
tissue remodeling during embryonic development, wound healing, and normal growth. Proteases can 
play a role in regulatory processes by affecting the half life of regulatory proteins. Proteases are 
involved in the etiology or progression of disease states such as inflammation, angiogenesis, tumor 
dispersion and metastasis, cardiovascular disease, neurological disease, and bacterial, parasitic, and 
viral infections. 

Proteases can be categorized on the basis of where they cleave their substrates. 
Exopeptidases, which include aminopeptidases, dipeptidyl peptidases, tripeptidases, 
carboxypeptidases, peptidyl-di-peptidases, dipeptidases, and omega peptidases, cleave residues at the 
termini of dieir substrates. Endopeptidases, including serine proteases, cysteme proteases, and 
metalloproteases, cleave at residues within the peptide. Four principal categories of mammalian 
proteases have been identified based on active site structure, mechanism of action, and overall three- 
dimensional structure. (See Beynon, R.J. and J.S. Bond (1994) Proteolytic Enzymes: A Practical 
Approach , Oxford University Press, New York NY, pp. 1-5.) 
Serine Proteases 

The serine proteases (SPs) are a large, widespread family of proteolytic enzymes that mclude 
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the digestive enzymes trypsin and chymotrypsin, components of the complement and blood-clotting 
cascades, and enzymes that control the degradation and tumover of macromolecules within the cell 
and in the extracellular matrix. Most of the more than 20 subfamilies can be grouped into six clans, 
each with a common ancestor. These six clans are hypothesized to have descended from at least four 
evolutionarily distinct ancestors. SPs are named for the presence of a serine residue found in the 
active catalytic site of most families. The active site is defined by the catalytic triad, a set of 
conserved asparagine, histidine, and serine residues critical for catalysis. These residues form a 
charge relay network that facilitates substrate binding. Other residues outside the active site form an 
oxyanion hole that stabilizes the tetrahedral transition intermediate formed during catalysis. SPs have 
a wide range of substrates and can be subdivided into subfamilies on the basis of their substrate 
specificity. The ntiain subfamilies are named for the residue(s) after which they cleave: trypases 
(after arginine or lysine), aspases (after aspartate), chymases (after phenylalanine or leucine), metases 
(methionine), and serases (after serine) (Rawlings, N.D. and A.J. Barrett (1994) Methods Enzymol. 
244:19-61). 

Most mammalian serine proteases are synthesized as zymogens, inactive precursors that are 
activated by proteolysis. For example, trypsinogen is converted to its active form, trypsin, by 
enteropeptidase. Enteropeptidase is an intestinal protease that removes an N-terminal fragment from 
trypsinogen. The remaining active fragment is trypsin, which in turn activates the precursors of the 
other pancreatic enzymes. Likewise, proteolysis of prothrombin, the precursor of thrombin, generates 
three separate polypeptide fragments. The N-terminal ft^agment is released while the other two 
fragments, which comprise active thrombin, remain associated through disulfide bonds. 

The two largest SP subfamilies are the chymotrypsin (SI) and subtilisin (S8) families. Some 
members of the chymotrypsin family contain two structiu-al domains unique to this family. Kringle 
domains are triple-looped, disulfide cross-linked domains found in varying copy number. Kringle 
domains are thought to play a role in binding mediators such as membranes, other proteins or 
phospholipids, and m the regulation of proteolytic activity (PROSITE PDOC00020). Apple domains 
are 90 amino-acid repeated domains, each containing six conserved cysteines. Three disulfide bonds 
link the first and sbcth, second and fifth, and third and fourth cysteines (PROSITE PDOC00376). 
Apple domains are involved in protein-protein interactions. SI family members include trypsin, 
chymotrypsin, coagulation factors IX-Xn, complement factors B, C, and D, granzymes, kallikrein, 
and tissue- and urokinase-plasminogen activators. The subtilisin family has members found in the 
eubacteria, archaebacteria, eukaryotes, and viruses. Subtilisins include the proprotein-processing 
endopeptidases kexin and fiirin and the pituitary prohormone convertases PCI, PC2, PCS, PC6, and 
PACE4 (Rawlings and Barrett, supra). 

SPs have functions in many normal processes and some have been implicated in the etiology 
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or treatment of disease. Enterokinase, the initiator of intestinal digestion, is found in the intestinal 
brush border, where it cleaves the acidic propeptide from trypsinogen to yield active trypsm 
(Kitamoto, Y. et al. (1994) Proc. Natl. Acad. ScL USA 91:7588-7592). Prolylcarboxypeptidase. a 
lysosomal serine peptidase that cleaves peptides such as angiotensin n and in and [des-Arg9] 
bradykinin, shares sequence homology with members of both the serine carboxypeptidase and 
prolylendopeptidase famiUes (Tan, F. et al. (1993) J. Biol. Chem. 268:16631-16638). The protease 
neuropsin may influence synapse formation and neuronal cormectivity in the hippocampus in 
response to neural signaling (Chen, Z.-L. et al. (1995) J. Neurosci. 15:5088-5097). Tissue 
plasminogen activator is useful for acute management of stroke (Zivin, J. A. (1999) Neurology 53:14- 
19) and myocardial infarction (Ross, A.M. (1999) Clin. Cardiol. 22:165-171). Some receptors (PAR, 
for proteinase-activated receptor), highly expressed throughout the digestive tract, are activated by 
proteolytic cleavage of an extracellular domain. The major agonists for PARs, thrombin, trypsin, and 
mast cell tryptase, are released in allergy and inflammatory conditions. Control of PAR activation by 
proteases has been suggested as a promising therapeutic target (Vergnolle, N. (2000) Aliment. 
Pharmacol. Then 14:257-266; Rice, K.D. et al. (1998) Curr. Pharm. Des. 4:381-396). Prostate- 
specific antigen (PSA) is a kallikrein-like serine protease synthesized and secreted exclusively by 
epithelial cells in the prostate gland. Serum PSA is elevated in prostate cancer and is the most 
sensitive physiological marker for monitoring cancer progression and response to therapy. PSA can 
also identify the prostate as the origin of a metastatic tumor (Brawer. M.K. and P.H. Lange (1989) 
Urology 33:11-16), 

The signal peptidase is a specialized class of SP found in all prokaryotic and eukaryotic cell 
types that serves in the processing of signal peptides from certain proteins. Signal peptides are 
amino-terminal domains of a protein which direct the protein from its ribosomal assembly site to a 
particular cellular or extracellular location. Once the protein has been exported, removal of the signal 
sequence by a signal peptidase and posttranslational processing, e.g., glycosylation or 
phosphorylation, activate the protein. Signal peptidases exist as multi-subunit complexes in both 
yeast and mammals. The canine signal peptidase complex is composed of five subunits, all 
associated with the microsomal membrane and containing hydrophobic regions that span the 
membrane one or more times (Shelness, G.S. and G. Blobel (1990) J. Biol. ChenL 265:9512-9519). 
Some of these subunits serve to fix the complex in its proper position on the membrane while others 
contain the actual catalytic activity. 

Thrombin is a serine protease with an essential role in the process of blood coagulation. 
Prothrombin, synthesized in the liver, is converted to active thrombin by Factor Xa. Activated 
thrombin then cleaves soluble fibrinogen to polymer-forming fibrin, a primary component of blood 
clots. In addition, thrombin activates Factor XUIa, which plays a role in cross-linking fibrin. 
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Thrombin also stimulates platelet aggregation through proteolytic processing of a 41-residue 
ainino-terminal peptide from protease-activated receptor 1 (PAR-1), formerly known as the thrombin 
receptor. The cleavage of the amino-terminal peptide exposes a new amino terminus and may also be 
associated with PAR-1 internalization (Stubbs, M.T. and W. Bode (1994) Curr. Opin. Struct. Biol. 
4:823-832; and Ofoso, F.A. et al. (1998) Biochem. J. 336:283-285), In addition to stimulating platelet 
activation through cleavage of the PAR-1 receptor, thrombin also induces platelet aggregation 
following cleavage of glycoprotein V, also on the surface of platelets. Glycoprotein V appears to be 
the major thrombin substrate on intact platelets. Platelets deficient for glycoprotein V are 
hypersensitive to thrombin, which is still required to cleave PAR-1. While platelet aggregation is 
required for normal hemostasis in mammals, excessive platelet aggregation can result in arterial 
thrombosis, atherosclerotic arteries, acute myocardial infarction, and stroke (Ramakrishnan, V. et al. 
(1999) Proc. Natl. Acad. Sci. U.S.A. 96:13336-13341 and references within). 

Proteases in another family have a serine in their active site and are dependent on the 
hydrolysis of ATP for their activity. These proteases contain proteolytic core domains and regulatory 
ATPase domains which can be identified by the presence of the P-loop, an ATP/GTP-binding motif 
(PROSITE PDOC00803). Members of this family include the eukaryotic mitochondrial matrix 
proteases, Clp protease and the proteasome. Clp protease was originally found in plant chloroplasts 
but is believed to be widespread in both prokaryotic and eukaryotic cells. The gene for early-onset 
torsion dystonia encodes a protein related to Clp protease (Ozelius, L.J. et al. (1998) Adv. Neurol. 
78:93-105). 

The proteasome is an intracellular protease complex found in some bacteria and in all 
eukaryotic cells, and plays an important role in cellular physiology. The proteasome is a large (-2000 
kDa) multisubunit complex composed of a central catalytic core containing a variety of proteases 
arranged in four seven-membered rings with the active sites facing inwards into the central cavity, 
and terminal ATPase subunits covering the outer port of the cavity and regulating substrate entry (for 
review, see Schmidt, M. et al. (1999) Curr. Opin. Chem. Biol. 3:584-591). Proteasomes are 
associated with the ubiquitin conjugation system (UCS), a major pathway for the degradation of 
cellular proteins of all types, including proteins that function to activate or repress cellular processes 
such as transcription and cell cycle progression (Ciechanover, A. (1994) Cell 79:13-21). In the UCS 
pathway, proteins targeted for degradation are conjugated to ubiquitin, a small heat stable protein. 
The ubiquitinated protein is then recognized and degraded by the proteasome. The resultant 
ubiquitin-peptide complex is hydrolyzed by a ubiquitin carboxyl terminal hydrolase, and &ee 
ubiquitin is released for reutilization by the UCS. Ubiquitin-proteasome systems are implicated in the 
degradation of mitotic cyclic kinases, oncoproteins, tumor suppressor genes (p53), cell surface 
receptors associated with signal transduction, transcriptional regulators, and mutated or damaged 
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proteins-CCiechanover, supra). This pathway has been implicated in a number of diseases, mcluding 
cystic fibrosis, Angehnan's syndrome, and Liddle syndrome (reviewed in Schwartz, A.L. and A. 
Ciechanover (1999) Annu. Rev. Med. 50:57-74). A murine proto-oncogene, Unp, encodes a nuclear 
ubiquitin protease whose overexpression leads to oncogenic transformation of NIH3T3 cells. The 
human homolog of this gene is consistently elevated in small cell tumors and adenocarcinomas of the 
lung (Gray, D.A. (1995) Oncogene 10:2179-2183). Ubiquitin carboxyl terminal hydrolase is involved 
in the differentiation of a lymphoblastic leukemia cell line to a non-dividing mature state (Maid, A. et 
al. (1996) Differentiation 60:59-66). In neurons, ubiquitin carboxyl terminal hydrolase (PGP 9.5) 
expression is strong in the abnormal structures that occur in human neurodegenerative diseases 
(Lowe, J. et al. (1990) J. Pathol. 161:153-160). 
Cysteine Proteases 

Cysteine proteases (CPs) are involved in diverse cellular processes ranging from the 
processing of precursor proteins to intracellular degradation. Nearly half of the CPs known are 
present only in viruses. CPs have a cysteine as the major catalytic residue at the active site where 
catalysis proceeds via a thioester intermediate and is facilitated by nearby histidine and asparagme 
residues. A glutamine residue is also important, as it helps to form an oxyanion hole. Two inq)ortant 
CP families include the papain-like enzymes (CI) and the calpains (C2). Papain-like family members 
are generally lysosomal or secreted and therefore are synthesized with signal peptides as well as 
propeptides. Most members bear a conserved motif in the propeptide that may have structural 
significance (Karrer, K.M. et al. (1993) Proc. Natl. Acad. Sci. USA 90:3063-3067). Three- 
dimensional structures of papain family members show a bilobed molecule with the catalytic site 
located between the two lobes. Papains include cathepsins B, C, H, L, and S, certain plant allergens 
and dipeptidyl peptidase (for a review, see Rawlings, N.D. and A.J. Barrett (1994) Methods Enzymol. 
244:461-486). 

Some CPs are expressed ubiquitously, while others are produced only by cells of the immune 
system. Of particular note, CPs are produced by monocytes, macrophages and other cells which 
migrate to sites of inflammation and secrete molecules involved in tissue repair. Overabundance of 
these repair molecules plays a role in certain disorders. In autoimmune diseases such as rheumatoid 
arthritis, secretion of the cysteine peptidase cathepsin C degrades collagen, laminin, elastin and other 
structural proteins found in the extracellular matrix of bones. Bone weakened by such degradation is 
also more susceptible to tumor invasion and metastasis. Cathepsin L expression may also contribute 
to the influx of mononuclear cells which exacerbates the destruction of the rheumatoid synovium 
(Keyszer, G.M. (1995) Arthritis Rheum. 38:976-984). 

Calpains are calcium-dependent cytosolic endopeptidases which contain both an N-terminal 
catalytic domain and a C-terminal calcium-binding domain. Calpain is expressed as a proenzyme 



6 



wo 03/063688 PCTAJS03/02500 

heterodimer consisting of a catalytic subunit unique to each isofoim and a regulatory subunit common 
to different isoforms. Each subunit bears a calcium-binding EF-hand domain. The regulatory subunit 
also contains a hydrophobic glycine-rich domain that allows the enzyme to associate with cell 
membranes. Calpains are activated by increased intracellular calcium concentration, which induces a 
change in conformation and limited autolysis. The resultant active molecule requires a lower calcium 
concentration for its activity (Chan. S.L. and M.P. Mattson (1999) J- NeuroscL Res. 58:167-190). 
Calpain expression is predominantly neuronal, although it is present in other tissues. Several chronic 
neurodegenerative disorders, including ALS, Parkinson's disease and Alzheimer's disease are 
associated with increased calpain expression (Chan and Mattson, supra). Calpain-mediated 
breakdown of the cytoskeleton has been proposed to contribute to brain damage resulting from head 
injury (McCracken, E. et al. (1999) J. Neurotrauma 16:749-761). Calpam-3 is predominantly 
expressed in skeletal muscle, and is responsible for limb-girdle muscular dystrophy type 2A (Minami, 
N. et al. (1999) J. Neurol. Sci. 171:31-37). 

Another family of thiol proteases is the caspases, which are involved in the initiation and 
execution phases of apoptosis. A pro-apoptotic signal can activate initiator caspases that trigger a 
proteolytic caspase cascade, leading to the hydrolysis of target proteins and the classic apoptotic 
death of the cell. Two active site residues, a cysteine and a histidine, have been implicated in the 
catalytic mechanism. Caspases are among the most specific endopeptidases, cleaving after aspartate 
residues. Caspases are synthesized as inactive zymogens consisting of one large (p20) and one small 
(plO) subunit separated by a small spacer region, and a variable N-terminal prodomain. This 
prodomain interacts with cofactors that can positively or negatively affect apoptosis. An activating 
signal causes autoproteolytic cleavage of a specific aspartate residue (D297 in the caspase- 1 
numbering convention) and removal of the spacer and prodomain, leaving a plO/p20 heterodimer. 
Two of these heterodmiers interact via their small subunits to form the catalytically active tetramer. 
The long prodomains of some caspase family members have been shown to promote dimerization and 
auto-processing of procaspases. Some caspases contain a "death effector domain" in their prodomain 
by which they can be recruited into self-activating complexes with other caspases and FADD protein 
associated death receptors or the TNF receptor complex. In addition, two dimers from different 
caspase family members can associate, changing the substrate specificity of the resultant tetramer. 
Endogenous caspase inhibitors (inhibitor of apoptosis proteins, or lAPs) also exist. All these 
interactions have clear effects on the control of apoptosis (reviewed in Chan and Mattson, supra; 
Salveson, G.S. and V.M. Dixit (1999) Proc. Nati. Acad. Sci. USA 96:10964-10967). 

Caspases have been implicated in a number of diseases. Mice lacking some caspases have 
severe nervous system defects due to faOed apoptosis in the neuroepithelium and suffer early 
lethality. Others show severe defects in the inflammatory response, as caspases are responsible for 
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processing IL-lb and possibly other inflammatory cytokines (Chan and Mattson, supra). Cowpox 
virus and baculoviruses target caspases to avoid the death of their host cell and promote successful 
infection. In addition, increases in inappropriate apoptosis have been reported in AIDS, 
neurodegenerative diseases and ischemic injury, while a decrease in cell death is associated with 
cancer (Salveson and Dixit, supra; Thompson, C.B. (1995) Science 267:1456-1462). 
Aspartvl proteases 

Aspartyl proteases (APs) include the lysosomal proteases cathepsins D and E, as well as 
chymosin, renin, and the gastric pepsins. Most retroviruses encode an AP, usually as part of the pol 
polyprotein. APs, also called acid proteases, are monomeric enzymes consisting of two domiains, 
each domain containing one half of the active site with its own catalytic aspartic acid residue. APs 
are most active in the range of pH 2-3, at which one of the aspartate residues is ionized and the other 
neutral. The pepsin family of APs contains many secreted enzymes, and all are likely to be 
synthesized with signal peptides and propeptides. Most family members have three disulfide loops, 
the first '-5 residue loop following the first aspartate, the second 5-6 residue loop preceding the 
second aspartate, and the third and largest loop occurring toward the C terminus. Retropepsins, on 
the other hand, are analogous to a single domain of pepsin, and become active as homodimers with 
each retropepsin monomer contributing one half of the active site. Retropepsins are required for 
processing the viral polyproteins. 

APs have roles in various tissues, and some have been associated with disease. Renin 
mediates the first step in processing the hormone angiotensin, which is responsible for regulating 
electrolyte balance and blood pressure (reviewed in Crews, D.E. and S.R. Williams (1999) Hum. 
Biol. 71:475-503). Abnormal regulation and expression of cathepsins are evident in various 
inflammatory disease states. Expression of cathepsin D is elevated in synovial tissues from patients 
with rheumatoid arthritis and osteoarthritis. The increased expression and differential regulation of 
the cathepsins are linked to the metastatic potential of a variety of cancers (Chambers, A.F. et al. 
(1993) Crit. Rev. Oncol. 4:95-114). 
Metalloproteases 

Metalloproteases require a metal ion for activity, usually manganese or zinc. Examples of 
manganese metalloenzymes include aminopeptidase P and hmnan proline dipeptidase (PEPD), 
Aminopeptidase P can degrade bradykinin, a nonapeptide activated in a variety of inflammatory 
responses. Aminopeptidase P has been implicated in coronary ischemia/reperfusion injury. 
Administration of aminopeptidase P inhibitors has been shown to have a cardioprotective effect in 
rats (Ersahin, C. et al (1999) J. Cardiovasc. Pharmacol. 34:604-61 1). 

Most zinc-dependent metalloproteases share a common sequence in the zinc-binding domain. 
The active site is made up of two histidines which act as zinc ligands and a catalytic glutamic acid C- 
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terminal to the &st histidine. Proteins containing this signature sequence are known as the 
metzincins and include aminopeptidase N, angiotensin-converting enzyme, neurolysin, the matrix 
metalloproteases and the adamalysins (ADAMS). An alternate sequence is found in the zinc 
carboxypeptidases, in which all three conserved residues - two histidines and a glutamic acid - are 
involved in zinc binding. 

A number of the neutral metalloendopeptidases, including angiotensin converting enzyme and 
the aminopeptidases, are involved in the metabolism of peptide hormones. High aminopeptidase B 
activity, for example, is found in the adrenal glands and neurohypophyses of hypertensive rats (Prieto, 
I. et al. (1998) Horm. Metab. Res. 30:246-248). Oligopeptidase M/neurolysin can hydrolyze 
bradykinin as well as neurotensin (Serizawa, A. et al. (1995) J. Biol. Chem 270:2092-2098). 
Neurotensin is a vasoactive peptide that can act as a neurotransmitter in the brain, where it has been 
implicated in limiting food intake (Tritos, N.A. et al. (1999) Neuropeptides 33:339-349). 

The matrix metalloproteases (MMPs) are a family of at least 23 enzymes that can degrade 
components of the extracellular matrix (ECM). They are Zn^^ endopeptidases with an N-terminal 
catalytic domain. Nearly all members of the family have a hinge peptide and a C-terminal domain 
which can bind to substrate molecules in the ECM or to inhibitors produced by the tissue (TlMPs, for 
tissue inhibitor of metalloprotease; Campbell, I.L. and A. Pagenstecher (1999) Trends Neurosci. 
22:285-287). The presence of fibronectin-like repeats, transmembrane domains, or C-terminal 
hemopexinase-like domains can be used to separate MMPs into coUagenase, gelatinase, stromelysin 
and membrane-type MMP subfamilies. In the inactive form, the Zn^^ ion in the active site interacts 
with a cysteine in the pro-sequence. Activating factors disrupt the Zn^^-cysteine interaction, or 
"cysteine switch," exposing the active site. This partially activates the enzyme, which then cleaves 
off its propeptide and becomes fully active. MMPs are often activated by the serine proteases 
plasmin and furin. MMPs are often regulated by stoichiometric, noncovalent interactions with 
inhibitors; the balance of protease to inhibitor, then, is very important in tissue homeostasis (reviewed 
in Yong, V.W. et al. (1998) Trends Neurosci. 21:75-80). 

MMPs are implicated in a number of diseases including osteoarthritis (Mitchell, P. et al. 
(1996) J. Clm. Livest. 97:761-768), atherosclerotic plaque rupture (Sukhova, G.K. et al. (1999) 
Circulation 99:2503-2509), aortic aneurysm (Schneiderman, J. et al. (1998) Am. J. Path. 152:703- 
710), non-healmg wounds (Saarialho-Kere, U.K. et al. (1994) J. Clin. Invest. 94:79-88), bone 
resorption (Blavier, L. and J.M. Delaisse (1995) J. Cell Sci. 108:3649-3659), age-related macular 
degeneration (Steen, B. et al. (1998) Invest. Ophthahnol. Vis. Sci. 39:2194-2200), emphysema 
(Finlay, G.A. et al. (1997) Thorax 52:502-506), myocardial infarction (Rohde, L.E. et al. (1999) 
Circulation 99:3063-3070) and dilated cardiomyopathy (Thomas, C-V. et al. (1998) Ckculation 
97:1708-1715). MMP inhibitors prevent metastasis of mammary carcinoma and experimental tumors 
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in rat, and Lewis lung carcinoma, hemangioma, and human ovarian carcinoma xenografts in mice 
(Eccles, S.A. et al. (1996) Cancer Res. 56:2815-2822; Anderson et al. (1996) Cancer Res. 56:715- 
718; Volpert, O.V. et al. (1996) J. Clm. Invest. 98:671-679; Taraboletti, G. et al. (1995) J. Natl. 
Cancer Inst. 87:293-298; Davies, B. et al, (1993) Cancer Res. 53:2087-2091). MMPs may be active 
in Alzheimer's disease. A number of MMPs are implicated in multiple sclerosis, and adnoinistration 
of MM? inhibitors can relieve some of its symptoms (reviewed in Yong et al., supra). 

Another family of metalloproteases is the ADAMs, for A Disintegrin and Metalloprotease 
Domain, which they share with their close relatives the adamalysins, snake venom metalloproteases 
(SVMPs). ADAMs combine features of both cell surface adhesion molecules and proteases, 
containing a prodomain, a protease dontiain, a disintegrin domain, a cysteine rich domain, an 
epidermal growth factor repeat, a transmembrane domain, and a cytoplasmic tail. The first three 
domains hsted above are also found in the SVMPs. The ADAMs possess foiu: potential functions: 
proteolysis, adhesion, signaling and fusion. The ADAMs share the metzincin zinc binding sequence 
and are inhibited by some MMP antagonists such as TIMP-1. 

ADAMs are implicated in such processes as sperm-egg binding and fusion, myoblast fusion, 
and protein-ectodomain processing or shedding of cytokines, cytokine receptors, adhesion proteins 
and other extracellular protein domains (Schlondorff, J. and CP. Blobel (1999) J. Cell. Sci. 
1 12:3603-3617). The Kuzbanian protein cleaves a substrate in the NOTCH pathway (possibly 
NOTCH itself), activating the program for lateral inhibition in Drosophila neural development. Two 
ADAMs, TACE (ADAM 17) and ADAM 10, are proposed to have analogous roles in the processing 
of amyloid precursor protein in the brain (Schlondorff and Blobel, supra). TACE has also been 
identified as the TNF activating enzyme (Black, R.A et al. (1997) Nature 385:729-733). TNF is a 
pleiotropic cytokine that is important in mobilizing host defenses in response to infection or trauma, 
but can cause severe damiage in excess and is often overproduced in autoimmune disease. TACE 
cleaves membrane-bound pro-TNF to release a soluble form. Other ADAMs may be involved in a 
similar type of processing of other membrane-bound molecules. 

Protems of the ADAMTS sub-family have all of the features of ADAM family 
metalloproteases and contain an additional thrombospondm domain (TS). The prototypic ADAMTS 
was identified in mouse, and found to be expressed in heart and kidney and upregulated by 
promflammatory stimuli (Kuno, K. et al. (1997) J. Biol. Chem. 272:556-562). To date eleven 
members are recognized by the Hmnan Genome Organization (HUGO; 

http://www.gene.ucl.ac.Uk/users/hester/adamts.html#Approved). Members of this family have the 
ability to degrade aggrecan, a high molecular weight proteoglycan which provides cartilage with 
important mechanical properties including compressibility, and which is lost dming the development 
of arthritis. Enzymes which degrade aggrecan are thus considered attractive targets to prevent and 
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slow the degradation of articular cartilage (See, e.g., Tortorella, M.D. (1999) Science 284:1664-1666; 
Abbaszade, I. (1999) J. Biol. Chem. 274:23443-23450). Other members are reported to have 
antiangiogenic potential (Kmio et al., suprci) and/or procollagen processing (Colige, A. et al. (1997) 
Proc. Natl. Acad. ScL USA 94:2374-2379). 
; Protease inhibitors 

Protease inhibitors and other regulators of protease activity control the activity and effects of 
proteases. Protease inhibitors have been shown to control pathogenesis in anin:ial models of 
proteolytic disorders (Murphy, G. (1991) Agents Actions Suppl. 35:69-76). Low levels of the 
cystatins, low molecular weight inhibitors of the cysteine proteases, correlate with malignant 
progression of tumors (Calkins, C. et al. (1995) Biol. Biochem. Hoppe Seyler 376:71-80). The 
cystatin superfamily of protease inhibitors is characterized by a particular pattem of linearly arranged 
and tandemly repeated disulfide loops (Kellermann, J. et al. (1989) J. Biol. Chem. 264:14121-14128). 
Serpins are inhibitors of mammalian plasma serine proteases. Many serpins serve to regulate the 
blood clotting cascade and/or the complement cascade in mammals. Sp32 is a positive regulator of 
the mammalian acrosomal protease, acrosin, that binds the proenzyme, proacrosin, and thereby aides 
in packaging the enzyme into the acrosomal matrix (Baba, T. et al. (1994) J. BioL Chem. 269:10133- 
10140). The Kunitz family of serine protease inhibitors are characterized by one or more "Kunitz 
domains" containing a series of cysteine residues that are regularly spaced over approximately 50 
amino acid residues and form three intrachain disulfide bonds. Members of this family include 
aprotinin, tissue factor pathway inhibitor (TFPM and TFPI-2), inter-a-trypsin inhibitor, and bikunin 
(Marlor, C.W. et al. (1997) J. Biol. Chem. 272:12202-12208). Members of this family are potent 
inhibitors (in the nanomolar range) against serine proteases such as kallikrein and plasmin. Aprotinin 
has clinical utility in reduction of perioperative blood loss. 

A major portion of all proteins synthesized in eukaryotic cells are synthesized on the 
cytosolic surface of the endoplasmic reticulum (ER). Before these immature proteins are distributed 
to other organelles in the cell or are secreted, they must be transported into the interior lumen of the 
ER where post-translational modifications are performed. These modifications include protein 
folding and the formation of disulfide bonds, and N-linked glycosylations. 
Protein Isomerases 

Protein folding in the ER is aided by two principal types of protein isomerases, protein 
disulfide isomerase (PDI), and peptidyl-prolyl isomerase (PPI). PDI catalyzes the oxidation of free 
sulfhydryl groups in cysteine residues to form intramolecular disulfide bonds in proteins. PPI, an 
enzyme that catalyzes the isomerization of certain proline imidic bonds in oligopeptides and proteins, 
is considered to govern one of the rate limiting steps in the folding of many proteins to their final 
functional conformation. The cyclophilins represent a major class of PPI that was originally 
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identified as the major receptor for the munimosuppressive drug cyclosporin A (Handschumacher, 
R.E. et al. (1984) Science 226: 544-547). 
Protein Glycosylation 

The glycosylation of most soluble secreted and membrane-bound proteins by 
oligosaccharides linked to asparagine residues in proteins is also performed in the ER. This reaction 
is catalyzed by a membrane-bound enzyme, oligosaccharyl transferase. Although the exact purpose 
of this "N-linked" glycosylation is unknown, the presence of oligosaccharides tends to make a 
glycoprotein resistant to protease digestion. In addition, oligosaccharides attached to cell-surface 
proteins called selectins are known to function in cell-cell adhesion processes (Alberts, B. et al. 
(1994) Molecular Biology of the Cell Garland Publishing Co., New York, NY, p. 608>. "O-lmked" 
glycosylation of proteins also occurs ua the ER by the addition of N-acetylgalactosamine to the 
hydroxyl group of a serine or threonine residue followed by the sequential addition of other sugar 
residues to the first. This process is catalyzed by a series of glycosyltransferases, each specific for a 
particular donor sugar nucleotide and acceptor molecule (Lodish, H. et al. (1995) Molecular Cell 
Biology , W. H. Freeman and Co., New York, NY, pp. 700-708), In many cases, both N- and O-linked 
oligosaccharides appear to be required for the secretion of proteins or the movement of plasma 
membrane glycoproteins to the cell surface. 

An additional glycosylation mechanism operates in the ER specifically to target lysosomal 
enzymes to lysosomes and prevent their secretion. Lysosomal enzymes in the ER receive an N-linked 
oligosaccharide, like plasma membrane and secreted proteins, but are then phosphorylated on one or 
two mannose residues. The phosphorylation of mannose residues occurs in two steps, the first step 
being the addition of an N-acetylglucosamine phosphate residue by N-acetylglucosamine 
phosphotransferase, and the second the removal of the N-acetylglucosamine group by 
phosphodiesterase. The phosphorylated mannose residue then targets the lysosomal enzyme to a 
mannose 6-phosphate receptor which transports it to a lysosome vesicle (Lodish et al. supra^ pp. 708- 
711). 

Chaperones 

Molecular chaperones are proteins that aid in the proper folding of immature proteins and 
refolding of improperly folded ones, the assembly of protein subunits, and in the transport of 
unfolded protems across membranes. Chaperones are also called heat-shock proteins (hsp) because 
of thek tendency to be expressed in dramatically increased amounts following brief exposure of cells 
to elevated temperatures. This latter property most likely reflects their need m the refolding of 
proteins that have become denatured by the high temperatures. Chaperones may be divided into 
several classes according to then: location, function, and molecular weight, and include hsp60, TCPl, 
hsp70, hsp40 (also called DnaJ), and hsp90. For example, hsp90 binds to steroid hormone receptors. 
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represses transcription in the absence of the ligand, and provides proper folding of the ligand-binding 
domain of the receptor m the presence of the hormone (Burston, S.G. and A.R. Clarke (1995) Essays 
Biochem. 29:125-136). Hsp60 and hsp70 chaperones aid in the transport and folding of newly 
synthesized proteins. Hsp70 acts early in protein folding, binding a newly synthesized protein before 
it leaves the ribosome and transporting the protein to the mitochondria or ER before releasing the 
folded protein. Hsp60, along with hsplO, binds misfolded proteins and gives them the opportunity to 
refold correctly. All chaperones share an affinity for hydrophobic patches on incompletely folded 
proteins and the ability to hydrolyze ATP. The energy of ATP hydrolysis is used to release the hsp- 
bound protein in its properly folded state (Alberts et al., supra, pp. 214, 571-572). 
Expression profiling 

Microarrays are analytical tools used in bioanalysis. A microairay has a plurality of 
molecules spatially distributed over, and stably associated with, the surface of a solid support. 
Microarrays of polypeptides, polynucleotides, and/or antibodies have been developed and find use in 
a variety of applications, such as gene sequencing, monitoring gene expression, gene mapping, 
bacterial identification, drug discovery, and combinatorial chemistry. 

One area in particular in which microarrays find use is in gene expression analysis. Array 
technology can provide a simple way to explore the expression of a single polymorphic gene or the 
expression profile of a large niunber of related or unrelated genes. When the expression of a single 
gene is examined, arrays are employed to detect the expression of a specific gene or its variants. 
When an expression profile is examined, arrays provide a platform for identifying genes that are 
tissue specific, are affected by a substance being tested in a toxicology assay, are part of a signaling 
cascade, carry out housekeeping functions, or are specifically related to a particular genetic 
predisposition, condition, disease, or disorder. 

Lung cancer is the leading cause of cancer death in the United States, affecting more than 
100,000 men and 50,000 women each year. Nearly 90% of the patients diagnosed with lung 
cancer are cigarette smokers. Tobacco smoke contains thousands of noxious substances that 
induce carcinogen metabolizing enzymes and covalent DNA adduct formation in the exposed 
bronchial epithelium. In nearly 80% of patients diagnosed with lung cancer, metastasis has 
already occurred. Most conunonly lung cancers metastasize to pleura, brain, bone, pericardium, 
and liver. This adversely affects the overall five-year survival rate which is 37% for squamous 
carcinoma, 27% for adenocarcinoma and large cell carcinoma, and less than 1% for small cell 
carcinomas. Earlier diagnosis and an systematic approach to identification, staging, and 
treatment could positively affect patient outcome (DeVita et al. (1997) Cancer: Pr inciples and 
Practice of Oncology , Lippincott-Raven, Philadelphia PA) and Fauci et al. (1998) Harrison's 
Principals of Internal Medicine , McGraw Hill, New York, NY). 
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Lung cancers progress through a series of morphologically distinct stages from 
hyperplasia to invasive carcinoma. Malignant lung cancers are divided into two groups 
comprising four histopathological classes. The nonsmall cell lung carcinoma (NSCLC) group 
includes squamous cell carcinomas, adenocarcinomas, and large cell carcinomas and accounts for 
about 70% of all lung cancer cases. Adenocarcinomas typically arise in the peripheral airways 
and often form mucin secreting glands. Squamous cell carcinomas typically arise in proximal 
airways. The histogenesis of squamous cell carcinomas may be related to chronic inflanmiation 
and injury to the bronchial epithelium, leading to squamous metaplasia. The small cell lung 
carcinoma (SCLC) group accounts for about 20% of lung cancer cases, SCLCs typically arise in 
proximal airways and exhibit a number of paraneoplastic syndromes including inappropriate 
production of adrenocorticotropin and anti-diuretic hormone. 

Lung cancer cells accumulate numerous genetic lesions, many of which are associated 
with cytologically visible chromosomal aberrations. The high frequency of chromosomal 
deletions associated with lung cancer may reflect the role of multiple tumor suppressor loci in the 
etiology of this disease. Several studies report deletions of regions of chromosome 11 in NSCLC 
(Bepler, G. and Garcia-Blanco, M.A. (1994) Proc, Natl. Acad. Sci. USA 91:5513-7; Bzuka, M., et 
al. (1995) Genes, Chromosomes & Cancer 13:40-46; Rasio, D. (1995) Cancer Research 55:3988- 
91). Deletions in other chromosome arms such as 3p, 9p and 17p are also common. Other 
frequently observed genetic lesions include overexpression of telomerase, activation of 
oncogenes such as K-ras and c-myc, and inactivation of tumor suppressor genes such as RB, p53 
and pl6 (Toomey, D. et al.(2001) Cancer 92:2648-57; Zajac-Kaye M. (2001) Lung Cancer 
34:S43-6; Wright, G. et al. (2000) Current Opinion in Oncology 12:143-8; Kohno, T. and 
Yokota, J. (1999) Carcinogenesis 20:1403-10). 

Prostate cancer is a common malignancy in men over the age of 50, and the incidence 
increases with age. In the US, there are approximately 132,000 newly diagnosed cases of prostate 
cancer and more than 33,000 deaths from the disorder each year. 

Once cancer cells arise in the prostate, they are stimulated by testosterone to a more rapid 
growth. Thus, removal of the testes can indirectly reduce both rapid growth and metastasis of the 
cancer. Over 95 percent of prostatic cancers are adenocarcinomas which originate in the prostatic 
acini. The remaining 5 percent are divided between squamous cell and transitional cell carcinomas, 
both of which arise in the prostatic ducts or other parts of the prostate gland. 

As with most cancers, prostate cancer develops through a multistage progression ultimately 
resulting in an aggressive, metastatic phenotype. The initial step in tumor progression involves the 
hyperproliferation of normal luminal and/or basal epithelial cells that become hyperplastic and evolve 
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into early-stage tumors. The early-stage tumors are localized in the prostate but eventually may 
metastasize, particularly to the bone, brain or lung. About 80% of these tumors remain responsive to 
androgen treatment, an important hormone controlling the growth of prostate epithelial cells. 
However, in its most advanced state, cancer growth becomes androgen-independent and there is 
currently no known treatment for this condition. 

A primary diagnostic marker for prostate cancer is prostate specific antigen (PSA). PSA is a 
tissue-specific serine protease almost exclusively produced by prostatic epithelial cells. The quantity 
of PSA coirelates with the number and volume of the prostatic epithelial cells, and consequently, the 
levels of PSA are an excellent indicator of abnormal prostate growth. Men with prostate cancer 

• exhibit an early linear increase in PSA levels followed by an exponential increase prior to diagnosis. 
However, since PSA levels are also influenced by factors such as inflammation, androgen and other 
growth factors, some scientists maintain that changes in PSA levels are not useful in detectmg 
individual cases of prostate cancer. 

Current areas of cancer research provide additional prospects for markers as well as potential 
: therapeutic targets for prostate cancer. Several growth factors have been shown to play a critical role 
in tumor development, growth, and progression. The growth factors Epidermal Growth Factor 
(EGF), Fibroblast Growth Factor (FGF), and Tumor Growth Factor alpha (TGFa) are important in the 
growth of normal as well as hyperproliferative prostate epithelial cells, particularly at early stages of 
tumor development and progression, and affect signaling pathways in these cells in various ways (Lin 

* J et al. (1999) Cancer Res. 59:2891-2897; Putz T et al. (1999) Cancer Res 59:227-233). The TGF-P 
family of growth factors are generally expressed at increased levels in human cancers and the high 
expression levels in many cases correlates with advanced stages of n[ialignancy and poor survival 
(Gold LI (1999) Crit Rev Oncog 10:303-360). Finally, there are human cell hues representing both 
the androgen-dependent stage of prostate cancer (LNCap) as well as the androgen-independent, 
hormone refractory stage of the disease (PC3 and DU-145) that have proved useful in studying gene 
expression patterns associated with the progression of prostate cancer, and the effects of cell 
treatments on these expressed genes (Chung TD (1999) Prostate 15:199-207). 

Ovarian cancer is the leading cause of death from a gynecologic cancer. The majority of 
ovarian cancers are derived from epithelial cells, and 70% of patients with epithelial ovarian cancers 
present with late-stage disease. As a result the loingterm survival rates for this disease are very low. 
Identification of early stage markers for ovarian cancer would significantly increase the survival rate. 
The molecular events that lead to ovarian cancer are poorly understood. Some of the known 
aberrations include mutation of p53 and microsatellite instability. Osteosarcoma is the most common 
malignant bone tumor in children. With currently available treatment regunens, approximately 
30-40% of patients with non-metastatic disease relapse after therapy. Currently, there is no 



15 



wo 03/063688 PCT/US03/02500 

prognostic factor that can be used at the time of initial diagnosis to predict which patients will have a 
high risk of relapse. The only significant prognostic factor predicting the outcome in a patient with 
non-metastatic osteosarcoma is the histopathologic response of the primary tumor resected at the time 
of definitive surgery. 

There are more than 180,000 new cases of breast cancer diagnosed each year, and the 
mortahty rate for breast cancer approaches 10% of all deaths in females between the ages of 45-54 
(K. Gish (1999) AWIS Magazme 28:7-10). However the survival rate based on early diagnosis of 
localized breast cancer is extremely high (97%), compared with the advanced stage of the disease in 
which the tumor has spread beyond the breast (22%). Current procedures for clinical breast 
exammation are lacking m sensitivity and specificity, and efforts are underway to develop 
comprehensive gene expression profiles for breast cancer that may be used in conjunction with 
conventional screening methods to improve diagnosis and prognosis of this disease (Perou CM et al. 
(2000) Nature 406:747-752). 

Breast cancer is a genetic disease commonly caused by mutations in cellular disease. 
Mutations in two genes, BRCAl and BRCA2, are known to greatly predispose a woman to breast 
cancer and may be passed on from parents to children (Gish, supra ). However, this type of hereditary 
breast cancer accounts for only about 5% to 9% of breast cancers, while the vast majority of breast 
cancer is due to noninherited mutations that occur in breast epithelial cells. 

A good deal is already known about the expression of specific genes associated with breast 
cancer. For example, the relationship between expression of epidermal growth factor (EOF) and its 
receptor, EGFR, to human mammary carcinoma has been particularly well studied. (See Khazaie et 
al., supra, and references cited therein for a review of this area.) Overexpression of EGFR, 
particularly coupled with down-regulation of the estrogen receptor, is a marker of poor prognosis in 
breast cancer patients, hi addition, EGFR expression in breast tumor metastases is frequently 
elevated relative to the primary tumor, suggesting that EGFR is uivolved in tumor progression and 
metastasis. This is supported by accumulating evidence that EGF has effects on cell functions related 
to metastatic potential, such as cell motility, chemotaxis, secretion and differentiation. Changes in 
expression of other members of the erbB receptor family, of which EGFR is one, have also been 
implicated in breast cancer. The abundance of erbB receptors, such as HER-2/neu, HER-3, and HER- 
4, and tiieir ligands m breast cancer points to their functional importance in the pathogenesis of the 
disease, and may therefore provide targets for therapy of the disease (Bacus, SS et al. (1994) Am J 
Clin Pathol 102:S13-S24). Other known markers of breast cancer include a human secreted frizzled 
protein mRNA that is downregulated in breast tumors; the matrix Gla protem which is overexpressed 
is human breast carcinoma cells; Drgl or RTF, a gene whose expression is dimmished in colon, 
breast, and prostate tumors; maspin, a tumor suppressor gene downregulated in invasive breast 
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carcinomas; and CaN19, a member of tbe SlOO protein family, all of which are down regulated in 
mammary carcinoma cells relative to normal mammary epithelial cells (Zhou Z et al. (1998) Lit J 
Cancer 78:95-99; Chen, L et al. (1990) Oncogene 5:1391-1395; Ukix W et al (1999) FEES Lett 
455:23-26; Sager, R et al. (1996) Curr Top Microbiol Immunol 213:51-64; and Lee, SW et al. (1992) 
Proc Natl Acad Sci USA 89:2504-2508). 

Cell lines derived from human mammary epithelial cells at various stages of breast cancer 
provide a useful model to study the process of malignant transformation and tumor progression as it 
has been shown that these cell lines retain many of the properties of their parental tumors for lengthy 
culture periods (Wistuba H et al. (1998) Clm Cancer Res 4:2931-2938). Such a model is particularly 
useful for comparing phenotypic and molecular characteristics of human mammary epithelial cells at 
various stages of malignant transformation. 

Colon cancer evolves through a multi-step process whereby pre-malignant colonocytes 
undergo a relatively defined sequence of events leading to tumor formation. While soft tissue 
sarcomas are relatively rare, more than 50% of new patients diagnosed with the disease will die from 
it. The molecular pathways leading to the development of sarcomas are relatively unknown, due to 
the rarity of the disease and variation in pathology. Several factors participate in the process of tumor 
progression and mahgnant transformation including genetic factors, mutations, and selection. 

To understand the nature of gene alterations in colorectal cancer, a number of studies have 
focused on the inherited syndromes. Familial adenomatous polyposis (FAP), is caused by mutations 
in the adenomatous polyposis coli gene (APC), resulting in truncated or inactive forms of the protem. 
This tumor suppressor gene has been mapped to chromosome 5q. Hereditary nonpolyposis colorectal 
cancer (HNPCC) is caused by mutations in mis-match repair genes. Although hereditary colon cancer 
syndromes occur in a small percentage of the population and most colorectal cancers are considered 
sporadic, knowledge from studies of the hereditary syndromes can be generally applied. For instance, 
somatic mutations in APC occur in at least 80% of sporadic colon tumors. APC mutations are 
thought to be the initiating event in the disease. Other mutations occur subsequently. Approximately 
50% of colorectal cancers contaui activating mutations in ras, while 85% contain inactivating 
mutations in p53. Changes in all of these genes lead to gene expression changes in colon cancer. 

Steroids are a class of lipid-soluble molecules, including cholesterol, bile acids, vitamin D, 
and hormones, that share a common four-ring structure based on cyclopentano-perhydrophenanthrene 
and that carrry out a wide variety of functions. Cholesterol, for example, is a component of cell 
membranes that controls membrane fluidity. It is also a precursor for bile acids which solubilize 
lipids and facilitate absorption in the small intestine during digestion. Vitamin D regulates the 
absorption of calcium in the small intestine and controls the concentration of calcium in plasma. 
Steroid hormones, produced by the adrenal cortex, ovaries, and testes, mclude glucocorticoids. 
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mineralocorticoids, androgens, and estrogens. They control various biological processes by binding 
to intracellular receptors that regulate transcription of specific genes in the nucleus. Glucocorticoids, 
for example, increase blood glucose concentrations by regulation of gluconeogenesis in the liver, 
increase blood concentrations of fatty acids by promoting lipolysis in adipose tissues, modulate 
sensitivity to catcholamines in the central nervous system, and reduce inflammation. The principal 
mineralocorticoid, aldosterone, is produced by the adrenal cortex and acts on cells of the distal 
tubules of the kidney to enhance sodium ion reabsorption. Androgens, produced by the interstitial 
cells of Leydig in the testis, include the male sex hormone testosterone, which triggers changes at 
puberty, the production of sperm and maintenance of secondary sexual characteristics. Female sex 
hormones, estrogen and progesterone, are produced by the ovaries and also by the placenta and 
adrenal cortex of the fetus during pregnancy. Estrogen regulates female reproductive processes and 
secondary sexual characteristics. Progesterone regulates changes in the endometrium during the 
menstrual cycle and pregnancy. 

Steroid hormones are widely used for fertility control and in anti-inflammatory treatments for 
physical injuries and diseases such as arthritis, asthma, and auto-immune disorders. Progesterone, a 
naturally occurring progestin, is primarily used to treat amenorrhea, abnormal uterine bleeding, or as 
a contraceptive. Endogenous progesterone is responsible for inducing secretory activity in the 
endometrium of the estrogen-primed uterus in preparation for the implantation of a fertilized egg and 
for the maintenance of pregnancy. It is secreted from the corpus luteum in response to luteinizing 
* hormone (LH). The primary contraceptive effect of exogenous progestins involves the suppression 
of the midcycle surge of LH. At the cellular level, progestins diffuse freely into target cells and bind 
to the progesterone receptor. Target cells include the female reproductive tract, the mammary gland, 
the hypothalamus, and the pituitary. Once bound to the receptor, progestms slow the frequency of 
release of gonadotropin releasing hormone from the hypothalamus and blunt the pre-ovulatory LH 
surge, thereby preventing follicular maturation and ovulation. Progesterone has minimal estrogenic 
and androgenic activity. Progesterone is metabolized hepatically to pregnanediol and conjugated with 
glucuronic acid. 

Corticosteroids are used to relieve inflamnaation and to suppress the immune response. They 
inhibit eosinophil, basophil, and airway epithelial cell function by regulation of cytokines that 
mediate the inflammatory response. They inhibit leukocyte infiltration at the site of inflammation, 
interfere in the function of mediators of the inflammatory response, and suppress the humoral 
immune response. Corticosteroids are used to treat allergies, asthma, arthritis, and skin conditions. 
Budesonide is a corticosteroid used to control symptonas associated with allergic rhinitis or asthma. 
Budesonide has high topical anti-inflammatory activity but low systenaic activity. 



18 



wo 03/063688 PCT/US03/02500 

The most importaBt function of adipose tissue is its ability to store and release fat during 
periods of feeding and fasting. White adipose tissue is the major energy reserve in periods of excess 
energy use. Its primary purpose is mobilization during energy deprivation. Understanding how 
various molecules regulate adiposity and energy balance in physiological and pathophysiological 
situations may lead to the development of novel therapeutics for human obesity. Adipose tissue is 
also one of the important target tissues for insulin. Adipogenesis and insulin resistance in type n 
diabetes are linked and present intriguing relations. Most patients with type n diabetes are obese and 
obesity in turn causes insulin resistance. 

The majority of research in adipocyte biology to date has been done using transformed mouse 
preadipocyte cell lines. The culture condition which stimulates mouse preadipocyte differentiation is 
different from that for inducing human primary preadipocyte differentiation. Li addition, primary 
cells are diploid and may therefore reflect the in vivo context better than aneuploid cell lines. 
Understanding the gene expression profile during adipogenesis in humans will lead to understanding 
the fundamental mechanism of adiposity regulation. Furthermore, through comparing the gene 
expression profiles of adipogenesis between donor with normal weight and donor with obesity, 
identification of crucial genes, potential drug targets for obesity and type II diabetes, will be possible. 

Insulin sensitivity can be enahnced by various compounds. Thiazolidinediones (TZDs) act as 
agonists for the peroxisome-proliferator-activated receptor gamma (PPARy), a member of the nuclear 
hormone receptor superfamily. TZDs reduce hyperglycemia, hyperinsulinemia, and hypertension, in 
part by promoting glucose metabolism and inhibiting gluconeogenesis. Roles for PPARy and its 
agonists have been demonstrated in a wide range of pathological conditions including diabetes, 
obesity, hypertension, atherosclerosis, polycystic ovarian syndrome, and cancers such as breast, 
prostate, liposarcoma, and colon cancer. 

The mechanism by which TZDs and other PPARy agonists enhance insulin sensitivity is not 
fully understood, but may involve the ability of PPARy to promote adipogenesis. When ectopically 
expressed in cultured preadipocytes, PPARy is a potent inducer of adipocyte differentiation. TZDs, 
in combination with insulin and other factors, can also enhance differentiation of human 
preadipocytes in culture (Adams et al. (1997) J. Clin. Invest. 100:3149-3153). The relative potency 
of different TZDs in promoting adipogenesis in vitro is proportional to both their insulin sensitizing 
effects in vivo , and their ability to bmd and activate PPARy in vitro . Interestingly, adipocytes derived 
from omental adipose depots are refractory to the effects of TZDs. It has therefore been suggested 
that the insulin sensitizing effects of TZDs may result from their ability to promote adipogenesis in 
subcutaneous adipose depots (Adams et al., supraV Further, dominant negative mutations in the 
PPARy gene have been identified in two non-obese subjects with severe insulin resistance. 
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hypertension, and overt non-insulin dependent diabetes mellitus (NIDDM) (Barroso et al. (1998) 
Nature 402:880-883). 

NE)DM is the most common form of diabetes mellitus, a chronic metabolic disease that 
affects 143 million people worldwide. NIDDM is characterized by abnormal glucose and lipid 
metabolism that result from a combination of peripheral insulin resistance and defective insulin 
secretion. NIDDM has a complex, progressive etiology and a high degree of heritability . Numerous 
complications of diabetes including heart disease, stroke, renal failure, retinopathy, and peripheral 
neuropathy contribute to the high rate of morbidity and mortality. 

Tangier disease (TD) is a genetic disorder characterized by near absence of circulating high 
density lipoprotein (HDL) and the accumulation of cholesterol esters in many tissues, including 
tonsils, lymph nodes, liver, spleen, thymus, and intestine. Low levels of HDL represent a clear 
predictor of premature coronary artery disease and homozygous TD correlates with a four- to six-fold 
increase in cardiovascular disease compared to controls. HDL plays a cardio-protective role in 
reverse cholesterol transport, the flux of cholesterol from peripheral cells such as tissue macrophages 
through plasma lipoproteins to the liver. The HDL protein, apolipoprotein A-I, plays a major role in 
this process, interacting with the cell surface to remove excess cholesterol and phospholipids. This 
pathway is severely impaired in TD and the defect lies in a specific gene, the ABCl transporter. This 
gene is a member of the family of ATP-binding cassette transporters, which utilize ATP hydrolysis to 
transport a variety of substrates across membranes. 

Leukocytes comprise lymphocytes, granulocytes, and monocytes. Lymphocytes include T- 
and B-cells, which specifically recognize and respond to foreign pathogens. T-cells fight viral 
infections and activate other leukocytes, while B-cells secrete antibodies that neutralize bacteria and 
other microbes. Granulocytes and monocytes are primarily migratory, phagocytic cells that exit the 
bloodstream to fight infection in tissues. Monocytes, which are derived from immature 
promonocytes, further differentiate into macrophages that engulf and digest microorganisms and 
damaged or dead cells. Monocytes and macrophages modulate the immune response by secretmg 
signaling molecules such as growth factors and cytokines. Tumor necrosis factor-a (TNF-a), for 
example, is a macrophage-secreted protein with anti-tumor and anti-viral activity. In addition, 
monocytes and macrophages are recruited to sites of infection and inflammation by signaling proteins 
secreted by other leukocytes. The differentiation of the monocyte blood cell lineage can be studied m 
vitro using cultured cell lines. For example, THP-1 is a human promonocyte cell line that can be 
activated by treatment with both phorbol ester such as phorbol myristate acetate (PMA), and 
lipopolysaccharide (LPS). PMA is a broad activator of the protein kinase C-dependent pathways. 

Monocytes are involved in the initiation and maintenance of inflammatory immune 
responses. The outer membrane of gram-negative bacteria expresses lipopolysaccharide (LPS) 
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coiiq>lexes called endotoxins. Toxicity is associated with the lipid coii5)onent Q^ipid A) of LPS, and 
immunogenicity is associated with the polysaccharide components of LPS. LPS elicits a variety of 
inflammatory responses, and because it activates complement by the alternative (properdin) pathway, 
it is often part of the pathology of gram-negative bacterial mfections. For the most part, endotoxins 
remain associated with the cell wall until the bacteria disintegrate. LPS released into the bloodstream 
by lysing gram-negative bacteria is first bound by certain plasma protems identified as LPS-binding 
proteins. The LPS-binding protein complex interacts with CD14 receptors on monocytes, 
macrophages, B cells, and other types of receptors on endothelial cells. Activation of human B cells 
with LPS results in mitogenesis as well as inamunoglobulLn synthesis. In monocytes and 
macrophages three types of events are triggered during their interaction with LPS: 1) Production of 
cytokmes, including IL-1, IL-6, IL-8, TNF-a , and platelet-activating factor, which stimulate 
production of prostaglandins and leukotrienes that mediate inflammation and septic shock; 2) 
Activation of the complement cascade; and 3) Activation of the coagulation cascade. 

Osteoarthritis (OA) is a debilitating joint disease involving focal cartilage loss. Several 
studies indicate a major genetic component can be involved in causing OA. Estimates of 
inheritabUity from twin studies of radiographic OA of the hand, knee and hip range from 36% to 68% 
(MacGregor, A.J. and Spector, T.D. (1999) Rheumatology 38:583-560). Several interleukin and 
interleukin-associated genes are located at 2ql2-q22 (Leppavouri, J. et al. (1999) Am. J. Hum. Genet. 
65:1060-1067). Interleukins regulate a number of enzymes that degrade the cartilage extracellular 
matrix, and the expression of certam interleukin genes, including IL-lp, is altered in OA joint tissue 
(Elson, C.J. et al. (1998) Br. J. Rheum. 37:106-107. 

There is a need in the art for new compositions, including nucleic acids and proteins, for the 
diagnosis, prevention, and treatment of gastrointestinal, cardiovascular, autoimmune/mflammatory, 
cell proliferative, developmental, epithelial, neurological, and reproductive disorders. 

SUMMARY OF THE INVENTION 

Various embodiments of the invention provide purified polypeptides, protein modification 
and maintenance molecules, referred to collectively as TMMM' and individually as TMMM-1,' 
TMMM-2,' TMMM-3,' TMMM-4,' TMMM-5,* TMMM-6,' TMMM-7,' 'PMMM-8,' TMMM- 
9,' TMMM-10,' TMMM-1 1,' TMMM-12,' TMMM-13,' TMMM-14,' TMMM-15,' TMMM- 
16,' TMMM-17,' TMMM-18,' TMMM-19,* TMMM-20,' TMMM-21,' TMMM-22,' TMMM- 
23,' TMMM-24,' TMMM-25,' TMMM-26,' TMMM-27,' TMMM-28,' TMMM-29,' TMMM- 
30,' TMMM-31,' TMMM-32,' TMMM-33,' TMMM-34,' TMMM-35,' TMMM-36,' TMMM- 
37,' TMMM-38,' TMMM-39,' TMMM-40,' TMMM-41,' TMMM-42,' TMMM-43,' TMMM- 
44,' TMMM-45,' TMMM-46,' TMMM-47,' TMMM-48,' TMMM-49,' TMMM-50,' TMMM- 
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51/ TlVIMM-52/ TMMM-53/ TMMM-54/ TMMM-55/ TMNIM-56/ TMMM-57/ and 
TMMM-58* and methods for using these proteins and their encoding polynucleotides for the 
detection, diagnosis, and treatment of diseases and medical conditions. Embodiments also provide 
methods for utilizing the purified protein modification and maintenance molecules and/or their 
encoding polynucleotides for facUitating the drug discovery process, including determination of 
efficacy, dosage, toxicity, and pharmacology. Related embodiments provide methods for utilizing the 
purified protem modification and maintenance molecules and/or their encoding polynucleotides for 
investigating the pathogenesis of diseases and medical conditions. 

An embodiment provides an isolated polypeptide selected from the group consisting of a) a 
polypeptide comprising an amino acid sequence selected firom the group consisting of SEQ ID NO: 1- 
58, b) a polypeptide comprising a naturally occurring amino acid sequence at least 90% identical or at 
least about 90% identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-58, c) a biologically active fragment of a polypeptide having an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-58, and d) an immunogenic fragment of a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-58. Another 
embodiment provides an isolated polypeptide comprising an amino acid sequence of SEQ ID 
NO: 1-58. 

Still another embodiment provides an isolated polynucleotide encoding a polypeptide 
selected from the group consisting of a) a polypeptide comprising an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-58, b) a polypeptide comprising a naturally occurring 
amino acid sequence at least 90% identical or at least about 90% identical to an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-58, c) a biologically active fragment of a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-58, 
and d) an inmiimogenic fragment of a polypeptide having an amino acid sequence selected from the 
group consistmg of SEQ ID NO: 1-58. In another embodiment, the polynucleotide encodes a 
polypeptide selected from the group consisting of SEQ ID NO: 1-58. In an alternative embodunent, 
the polynucleotide is selected from the group consisting of SEQ ID NO:59-116. 

Still another embodiment provides a recombinant polynucleotide comprising a promoter 
sequence operably linked to a polynucleotide encoding a polypeptide selected from the group 
consisting of a) a polypeptide comprising an amino acid sequence selected from the group consisting 
of SEQ ID NO: 1-58, b) a polypeptide comprising a naturally occurring amino acid sequence at least 
90% identical or at least about 90% identical to an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-58, c) a biologically active fragment of a polypeptide having an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-58, and d) an immunogenic 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
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ID NO: 1-58. Another embodiment provides a cell transformed with the recombinant polynucleotide. 
Yet another embodiment provides a transgenic organism comprising the recombinant polynucleotide. 

Another embodiment provides a method for producing a polypeptide selected from the group 
consisting of a) a polypeptide comprising an amino acid sequence selected from the group consisting 
of SEQ ID NO: 1-58, b) a polypeptide comprising a naturally occurring amino acid sequence at least 
90% identical or at least about 90% identical to an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-58, c) a biologically active fragment of a polypeptide having an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-58, and d) an immunogenic 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-58. The method comprises a) culturing a cell under conditions suitable for expression of the 
polypeptide, wherein said cell is transformed with a recombinant polynucleotide comprising a 
promoter sequence operably linked to a polynucleotide encoding the polypeptide, and b) recovering 
the polypeptide so expressed. 

Yet another embodiment provides an isolated antibody which specifically binds to a 
polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-58, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical or at least about 90% identical to an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-58, c) a biologically active 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-58, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-58. 

Still yet another embodiment provides an isolated polynucleotide selected from the group 
consisting of a) a polynucleotide comprising a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:59-l 16, b) a polynucleotide comprising a naturally occurring 
polynucleotide sequence at least 90% identical or at least about 90% identical to a polynucleotide 
sequence selected from the group consisting of SEQ ID NO:59-l 16, c) a polynucleotide 
complementary to the polynucleotide of a), d) a polynucleotide complementary to the polynucleotide 
of b), and e) an RNA equivalent of a)-d). In other embodiments, die polynucleotide can comprise at 
least about 20, 30, 40, 60, 80, or 100 contiguous nucleotides. 

Yet another embodiment provides a method for detecting a target polynucleotide in a sample, 
said target polynucleotide being selected from the group consisting of a) a polynucleotide comprising 
a polynucleotide sequence selected from the group consisting of SEQ ID NO:59-116, b) a 
polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% identical or at 
least about 90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID 
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NO:59-116, c) a polynucleotide complementary to the polynucleotide of a), d) a polynucleotide 
complementary to the polynucleotide of b), and e) an RNA equivalent of a)-d). The method 
comprises a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sample, and which probe 
specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization 
complex is formed between said probe and said target polynucleotide or fragments thereof, and b) 
detecting the presence or absence of said hybridization complex. In a related embodiment, the 
method can include detecting the amount of the hybridization complex. In still other embodiments, 
the probe can conq)rise at least about 20, 30, 40, 60, 80, or 100 contiguous nucleotides. 

Still yet another embodiment provides a method for detecting a target polynucleotide in a 
sample, said target polynucleotide being selected from the group consisting of a) a polynucleotide 
comprising a polynucleotide sequence selected from the group consisting of SEQ ID NO:59-116, b) a 
polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% identical or at 
least about 90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:59-116, c) a polynucleotide complementary to the polynucleotide of a), d) a polynucleotide 
complementary to the polynucleotide of b)^ and e) an RNA equivalent of a)-d). The method 
comprises a) amplifying said target polynucleotide or fragment thereof using polymerase chain 
reaction amplification, and b) detecting the presence or absence of said amplified target 
polynucleotide or fragment thereof In a related embodiment, the method can include detecting the 
amount of the amplified target polynucleotide or fragment thereof. 

Another embodiment provides a composition comprising an effective amount of a 
polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-58, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical or at least about 90% identical to an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-58, c) a biologically active 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-58, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 
selected from die group consisting of SEQ ID NO: 1-58, and a pharmaceuticaDy acceptable excipient. 
In one embodiment, the composition can comprise an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-58. Other embodunents provide a method of treating a disease or 
condition associated with decreased or abnormal expression of functional PMMM. comprising 
administering to a patient in need of such treatment the composition. 

Yet another embodiment provides a method for screening a compound for effectiveness as an 
agonist of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
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acid sequence selected from the group consisting of SEQ ID NO: 1-58, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical or at least about 90% identical to an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-58, c) a biologically active 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-58, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-58. The method comprises a) exposing a sample 
comprising the polypeptide to a conopound, and b) detecting agonist activity in the sample. Another 
embodiment provides a composition comprising an agonist compound identified by the method and a 
phaimaceutically acceptable excipient. Yet another embodiment provides a method of treating a 
disease or condition associated with decreased expression of functional PMMM, comprising 
administering to a patient in need of such treatment the composition. 

Still yet another embodiment provides a method for screening a compound for effectiveness 
as an antagonist of a polypeptide selected from the group consisting of a) a polypeptide comprising an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-58, b) a polypeptide 
comprising a naturally occurring amino acid sequence at least 90% identical or at least about 90% 
identical to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-58, c) a 
biologically active fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-58, and d) an immunogenic fragment of a polypeptide having an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-58. The method comprises a) 
exposing a sample comprising the polypeptide to a compound, and b) detecting antagonist activity in 
the sample. Another embodiment provides a composition comprising an antagonist compound 
identified by the method and a pharmaceutically acceptable excipient. Yet another embodiment 
provides a method of treating a disease or condition associated with overexpression of functional 
PMMM, comprising adncdnistering to a patient in need of such treatment the composition. 

Another embodiment provides a method of screening for a compound that specifically binds 
to a polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-58, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical or at least about 90% identical to an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-58, c) a biologically active 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-58, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-58. The method comprises a) combining the 
polypeptide with at least one test compound under suitable conditions, and b) detecting binding of the 
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polypeptide to the test compound, thereby identifying a compound that specifically binds to the 
polypeptide. 

Yet another embodiment provides a method of screening for a compound that modulates the 
activity of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-58, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical or at least about 90% identical to an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-58, c) a biologically active 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-58, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-58. The method comprises a) combining the 
polypeptide with at least one test compound under conditions permissive for the activity of the 
polypeptide, b) assessing the activity of the polypeptide in the presence of the test compound, and c) 
comparing the activity of the polypeptide in the presence of the test compound with the activity of the 
polypeptide in the absence of the test compound, wherein a change in the activity of the polypeptide 
in the presence of the test compound is indicative of a compound that modulates the activity of the 
polypeptide. 

Still yet another embodiment provides a method for screening a compound for effectiveness 
in altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
polynucleotide sequence selected from the group consisting of SEQ ID NO: 59-1 16, the method 
comprising a) exposing a sample comprising the target polynucleotide to a compound, b) detecting 
altered expression of the target polynucleotide, and c) comparing the expression of the target 
polynucleotide in the presence of varying amounts of the compound and in the absence of the 
compound. 

Another embodiment provides a method for assessing toxicity of a test compound, said 
method comprising a) treating a biological sample containing nucleic acids with the test compound; 
b) hybridizing the nucleic acids of the treated biological sample with a probe comprising at least 20 
contiguous nucleotides of a polynucleotide selected from the group consisting of i) a polynucleotide 
comprising a polynucleotide sequence selected from the group consisting of SEQ ID NO:59-l 16, ii) a 
polynucleotide conq)rising a naturally occurring polynucleotide sequence at least 90% identical or at 
least about 90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:59-l 16, iii) a polynucleotide having a sequence complementary to i), iv) a polynucleotide 
complementary to the polynucleotide of ii), and v) an RNA equivalent of i)-iv). Hybridization occurs 
under conditions whereby a specific hybridization complex is formed between said probe and a target 
polynucleotide in the biological sample, said target polynucleotide selected from the group consisting 
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of i) a polynucleotide comprising a polynucleotide sequence selected from ttie group consisting of 
SEQ ID NO:59-116, ii) a polynucleotide comprising a naturally occurring polynucleotide sequence at 
least 90% identical or at least about 90% identical to a polynucleotide sequence selected from the 
group consisting of SEQ ID NO:59-l 16, iii) a polynucleotide complementary to the polynucleotide of 
i), iv) a polynucleotide complementary to the polynucleotide of ii), and v) an RNA equivalent of i)- 
iv). Altematively, the target polynucleotide can comprise a fragment of a polynucleotide selected 
from the group consisting of i)-v) above; c) quantifying the amount of hybridization complex; and d) 
comparing the amount of hybridization complex in the treated biological sample witii the amount of 
hybridization complex in an untreated biological sample, wherein a difference in the amount of 
hybridization complex in tiie treated biological sample is indicative of toxicity of the test compound. 

BRIEF DESCMPTION OF THE TABLES 

Table 1 summarizes the nomenclature for full length polynucleotide and polypeptide 
embodiments of the invention. 

Table 2 shows the GenBank identification number and annotation of the nearest GenBank 
homolog, and the PROTEOME database identification numbers and annotations of PROTEOME 
database homologs, for polypeptide embodiments of the invention. The probability scores for the 
matches between each polypeptide and its homolog(s) are also shown. 

Table 3 shows structural features of polypeptide embodiments, including predicted motifs 
and domains, along with the methods, algorithms, and searchable databases used for analysis of the 
polypeptides. ^ 

Table 4 lists the cDNA and/or genomic DNA fragments which were used to assemble 
polynucleotide embodiments, along with selected fragments of the polynucleotides. 

Table 5 shows representative cDNA libraries for polynucleotide embodiments. 

Table 6 provides an appendix which describes the tissues and vectors used for construction of 
the cDNA libraries shown in Table 5. 

Table 7 shows the tools, programs, and algorithms used to analyze polynucleotides and 
polypeptides, along with applicable descriptions, references, and threshold parameters. 

Table 8 shows single nucleotide polymorphisms found in polynucleotide sequences of the 
invention, along with allele frequencies in different human populations. 

DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleic acids, and methods are described, it is understood that 
embodiments of the invention are not limited to the particular machines, instraments, materials, and 
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methods described, as these may vary. It is also to be understood that the terminology used herein is 
for the purpose of describing particular embodiments only, and is not intended to limit the scope of 
the invention. 

As used herein and in the appended claims, the singular forms "a," "an," and "the" include 
plural reference unless the context clearly dictates otherwise. Thus, for example, a reference to "a 
host cell" includes a plurality of such host cells, and a reference to "an antibody" is a reference to one 
or more antibodies and equivalents thereof known to those skilled in the art, and so forth. 

Unless defined otherwise, all technical and scientific terms used herein have the same 
meanings as commonly understood by one of ordinary skill in the art to which this invention belongs. 
Although any machines, materials, and methods similar or equivalent to those described herein can be 
used to practice or test the present invention, the preferred machines, materials and methods are now 
described. All publications mentioned herein are cited for the purpose of describing and disclosing 
the cell lines, protocols, reagents and vectors which are reported in the publications and which might 
be used in connection with various embodiments of the invention. Nothing herein is to be construed 
as an admission that the invention is not entitled to antedate such disclosure by virtue of prior 
invention. 
DEFINITIONS 

"PMMM" refers to the amino acid sequences of substantially purified PMMM obtained firom 
any species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and 
human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant. 

The term "agonist" refers to a molecule which intensifies or mimics the biological activity of 
PMMM. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other 
compound or composition which modulates the activity of PMMM either by directly interacting with 
PMMM or by acting on components of the biological pathway in which PMMM participates. 

An "allelic variant" is an alternative form of the gene encoding PMMM. Allelic variants may 
result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs or in 
polypeptides whose structure or function may or may not be altered. A gene may have none, one, or 
many allelic variants of its naturally occurring form. Common mutational changes which give rise to 
allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 
Each of these types of changes may occur alone, or in combination with the others, one or more times 
in a given sequence. 

"Altered" nucleic acid sequences encoding PMMM include those sequences with deletions, 
insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as PMMM or 
a polypeptide with at least one functional characteristic of PMMM. Included within this definition 
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are polymorphisms which may or may not be readily detectable using a particular oligonucleotide 
probe of the polynucleotide encoding PMMM, and hnproper or unexpected hybridization to allelic 
variants, with a locus other than the normal chromosomal locus for the polynucleotide encoding 
PMMM. The encoded protein may also be "altered " and may contain deletions, insertions, or 
substitutions of amino acid residues which produce a silent change and result in a functionally 
equivalent PMMM. Deliberate amino acid substitutions may be made on the basis of one or more 
similarities in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic 
nature of the residues, as long as the biological or immunological activity of PMMM is retained. For 
example, negatively charged ammo acids may mclude aspartic acid and glutamic acid, and positively 
charged amino acids may include lysine and arginine. Amino acids with uncharged polar side chains 
having similar hydrophilicity values may include: asparagine and glutamine; and serine and 
threonine. Amino acids with uncharged side chains havmg similar hydrophilicity values may include: 
leucine, isoleucine, and valine; glycine and alanine; and phenylalanine and tyrosine. 

The terms "amino acid" and "amino acid sequence" can refer to an oligopeptide, a peptide, a 
polypeptide, or a protein sequence, or a fragment of any of these, and to naturally occurring or 
synthetic molecules. Where "amino acid sequence" is recited to refer to a sequence of a naturally 
occurring protein molecule, "amino acid sequence" and like terms are not meant to limit the amino 
acid sequence to the complete native amino acid sequence associated with the recited protein 
molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid. 
Amplification may be carried out using polymerase chain reaction (PGR) technologies or other 
nucleic acid amplification technologies well known in the art. 

The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity 
of PMMM. Antagonists may include proteins such as antibodies, anticalins, nucleic acids, 
carbohydrates, small molecules, or any other con^)ound or composition which modulates the activity 
of PMMM either by directly interacting with PMMM or by acting on components of the biological 
pathway in which PMMM participates. 

The term "antibody" refers to intact immunoglobulm molecules as well as to fragments 
thereof, such as Fab, F(ab')2. and Fv fragments, which are capable of binduig an epitopic determinant. 
Antibodies that bind PMMM polypeptides can be prepared using intact polypeptides or using 
fragments containing small peptides of interest as the immunizing antigen. The polypeptide or 
oligopeptide used to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the 
translation of RNA, or synthesized chemically, and can be conjugated to a carrier protein if desired. 
Conunonly used carriers that are chemically coupled to peptides include bovine serum albumm. 
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thyroglobulin, and keyhole limpet hemocyanin (KLH). The coupled peptide is then used to inmiunize 
the animal. 

The term "antigenic determinant" refers to that region of a molecule (i.e., an epitope) that 
makes contact with a particular antibody. When a protein or a fragment of a protein is used to 
immunize a host animal, numerous regions of the protein may induce the production of antibodies 
which bind specifically to antigenic determinants (particular regions or three-dimensional structures 
on the protein). An antigenic determinant may compete with the intact antigen (i.e., tlie immunogen 
used to elicit the immune response) for binding to an antibody. 

The term "aptamer** refers to a nucleic acid or oligonucleotide molecule that binds to a 
specific molecular target. Aptamers are derived from an in vitro evolutionary process (e.g., SELEX 
(Systematic Evolution of Ligands by Exponential Enrichment), described in U.S. Patent No. 
5,270,163), which selects for target-specific aptamer sequences from large combinatorial libraries. 
Aptamer compositions may be double-stranded or single-stranded, and may include 
deoxyribonucleotides, ribonucleotides, nucleotide derivatives, or other nucleotide-like molecules. 
The nucleotide components of an aptamer may have modified sugar groups (e.g., the 2'-OH group of a 
ribonucleotide may be replaced by 2-F or 2'-NH2), which may improve a desired property, e.g., 
resistance to nucleases or longer lifetime in blood. Aptamers may be conjugated to other molecules, 
e.g., a high molecular weight carrier to slow clearance of the aptamer from the circulatory system. 
Aptamers may be specifically cross-linked to their cognate ligands, e.g., by photo-activation of a 
cross-linker (Brody, E.N. and L. Gold (2000) J. Biotechnol. 74:5-13). 

The term "intramer" refers to an aptamer which is expressed in vivo. For example, a vaccinia 
virus-based RNA expression system has been used to express specific RNA aptamers at high levels in 
the cytoplasm of leukocytes (Blind, M. et al. (1999) Proc. Natl. Acad. Sci. USA 96:3606-3610). 

The term "spiegelmer" refers to an aptamer which includes L-DNA, L-RNA, or other left- 
handed nucleotide derivatives or nucleotide-like molecules. Aptamers containing left-handed 
nucleotides are resistant to degradation by naturally occurring enzymes, which normally act on 
substrates containing right-handed nucleotides. 

The term "antisense" refers to any composition capable of base-pairing with the "sense" 
(coding) strand of a polynucleotide having a specific nucleic acid sequence. Antisense compositions 
may include DNA; RNA; peptide nucleic acid (PNA); oligonucleotides havmg modified backbone 
linkages such as phosphorothioates, methylphosphonates, or benzylphosphonates; oligonucleotides 
having modified sugar groups such as 2'-methoxyethyl sugars or 2'-methoxyethoxy sugars; or 
oligonucleotides having modified bases such as 5-methyl cytosine, 2 -deoxyiuracil, or 7-deaza-2 - 
deoxyguanosine. Antisense molecules may be produced by any method includmg chemical synthesis 
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or transcription. Once introduced into a cell, the complementary antisense molecule base-pairs with a 
naturally occurring nucleic acid sequence produced by tlie cell to form duplexes which block either 
transcription or translation. The designation "negative" or "mmus" can refer to the antisense strand, 
and the designation "positive" or "plus" can refer to the sense strand of a reference DNA molecule. 

The term "biologically active" refers to a protein havhag structural, regulatory, or biochemical 
functions of a naturally occurring molecule. Likewise, "immunologically active" or "immunogenic" 
refers to the capability of the natural, recombinant, or synthetic PMMM, or of any oligopeptide 
thereof, to induce a specific immune response in appropriate animals or cells and to bind with specific 
antibodies. 

"Complementary" describes the relationship between two single-stranded nucleic acid 
sequences that anneal by base-paking. For example, 5'-AGT-3* pairs with its complement, 
3'".TCA-5'. 

A "composition comprising a given polynucleotide" and a "composition comprising a given 
polypeptide" can refer to any composition containing the given polynucleotide or polypeptide. The 
composition may comprise a dry formulation or an aqueous solution. Compositions comprising 
polynucleotides encoding PMMM or fragments of PMMM may be employed as hybridization probes. 
The probes may be stored in freeze-dried form and may be associated with a stabilizing agent such as 
a carbohydrate. In hybridizations, the probe may be deployed in an aqueous solution containing salts 
(e.g., NaCl), detergents (e.g., sodium dodecyl sulfate; SDS), and other components (e.g., Denhardt's 
solution, dry milk, salmon sperm DNA, etc.). 

"Consensus sequence" refers to a nucleic acid sequence which has been subjected to repeated 
DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit (Applied 
Biosy stems, Foster City CA) in the 5' and/or the 3* direction, and resequenced, or which has been 
assembled from one or more overlapping cDNA, EST, or genomic DNA fragments usmg a computer 
program for fragment assembly, such as the GELVBBW fragment assembly system (Accelrys, 
Burlington MA) or Phrap (University of Washington, Seattle WA). Some sequences have been both 
extended and assembled to produce the consensus sequence. 

"Conservative amino acid substitutions" are those substitutions that are predicted to least 

interfere with the properties of the original protein, i.e., the structure and especially the function of 

the protein is conserved and not significantly changed by such substitutions. The table below shows 

amino acids which may be substituted for an original amino acid in a protein and which are regarded 

as conservative amino acid substitutions. 

Original Residue Conservative Substitution 

Ala Gly, Ser 

Arg His, Lys 
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Asn 

Asp 

Cys 

Gin 

Glu 

Gly 

His 

lie 

Leu 



Lys 

Met 

Phe 

Ser 

Thr 

Tip 

Tyr 

Val 



Asp, Gin, His 
Asn, Glu 
Ala, Ser 
Asn, Glu, His 
Asp, Gin, His 
Ala 

Asn, Arg, Gin, Glu 
Leu, Val 
He, Val 
Arg, Gin, Glu 
Leu, lie 

His, Met, Leu, Trp, Tyr 

Cys, Thr 

Ser, Val 

Phe, Tyr 

His, Phe, Trp 

Be, Leu, Thr 



Conservative amino acid substitutions generally maintain (a) the structure of the polypeptide 
backbone in the area of the substitution, for example, as a beta sheet or alpha helical conformation, 
(b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or (c) the bulk of 
the side chain. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
absence of one or more amino acid residues or nucleotides. 

The term "derivative" refers to a chemically modified polynucleotide or polypeptide. 
Chemical modifications of a polynucleotide can include, for example, replacement of hydrogen by an 
alkyl, acyl, hydroxyl, or amino group. A derivative polynucleotide encodes a polypeptide which 
retains at least one biological or immunological function of the natural molecule. A derivative 
polypeptide is one modified by glycosylation, pegylation, or any similar process that retains at least 
one biological or immunological function of the polypeptide from which it was derived. 

A "detectable label" refers to a reporter molecule or enzyme that is capable of generatmg a 
measurable signal and is covalently or noncovalently joined to a polynucleotide or polypeptide. 

"Differential expression" refers to increased or upregulated; or decreased, downregulated, or 
absent gene or protein expression, determined by comparing at least two different samples. Such 
comparisons may be carried out between, for example, a treated and an untreated sample, or a 
diseased and a normal sample. 

"Exon shuffling" refers to the recombination of different coding regions (exons). Since an 
exon may represent a structural or functional domain of the encoded protein, new proteins may be 
assembled through the novel reassortment of stable substructures, thus allowing acceleration of the 
evolution of new protein functions. 
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A "fragment" is a unique portion of PMMM or a polynucleotide encoding PMMM which can 
be identical in sequence to, but shorter in length than, the parent sequence. A fragment may comprise 
up to the entire length of the defined sequence, minus one nucleotide/amino acid residue. For 
example, a fragment may comprise from about 5 to about 1000 contiguous nucleotides or amino acid 
residues. A fragment used as a probe, primer, antigen, therapeutic molecule, or for other purposes, 
may be at least 5, 10, 15, 16, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous 
nucleotides or amino acid residues in length. Fragments may be preferentially selected from certain 
legions of a molecule. For example, a polypeptide fragment may comprise a certain length of 
contiguous amino acids selected from the first 250 or 500 ammo acids (or first 25% or 50%) of a 
polypeptide as shown in a certain defined sequence. Clearly these lengths are exemplary, and any 
length that is supported by the specification, including the Sequence Listing, tables, and figures, may 
be encompassed by the present embodiments. 

A fragment of SEQ ID NO:59-l 16 can comprise a region of unique polynucleotide sequence 
that specifically identifies SEQ ID NO:59-116, for example, as distinct from any other sequence in 
the genome from which the fragment was obtained. A fragment of SEQ ID NO:59-116 can be 
employed in one or more embodiments of methods of the invention, for example, in hybridization and 
amplification technologies and in analogous methods that distinguish SEQ ID NO:59-l 16 from 
related polynucleotides. The precise length of a fragment of SEQ ID NO:59-116 and the region of 
SEQ ID NO:59-l 16 to which the fragment corresponds are routinely determinable by one of ordinary 
skill in the art based on the intended purpose for the fragment. 

A fragment of SEQ ID NO: 1-58 is encoded by a fragment of SEQ ID NO:59-l 16. A 
fragment of SEQ ID NO: 1-58 can comprise a region of unique amino acid sequence that specifically 
identifies SEQ ID NO: 1-58. For example, a fragment of SEQ ID NO: 1-58 can be used as an 
immunogenic peptide for the development of antibodies that specifically recognize SEQ ID NO: 1-58. 
The precise length of a fragment of SEQ ID NO: 1-58 and the region of SEQ ID NO: 1-58 to which the 
fragment corresponds can be determined based on the intended purpose for the fragment usmg one or 
more analytical methods described herein or otherwise known in the art. 

A *full length" polynucleotide is one containing at least a translation initiation codon (e.g., 
methionine) followed by an open reading frame and a translation termination codon. A "fiill length" 
polynucleotide sequence encodes a "full length" polypeptide sequence. 

'^Homology" refers to sequence sunilarity or, alternatively, sequence identity, between two or 
more polynucleotide sequences or two or more polypeptide sequences. 

The terms "percent identity" and "% identity," as applied to polynucleotide sequences, refer 
to the percentage of identical nucleotide matches between at least two polynucleotide sequences 
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aligned using a standardized algorithm. Such an algorithm may insert, in a standardized and 
reproducible way, gaps in the sequences being compared in order to optimize alignment between two 
sequences, and therefore achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using one or more 
computer algorithms or programs known in the art or described herein. For example, percent identity 
can be determined using the default parameters of the CLUSTAL V algorithm as incorporated into 
the MEGALIGN version 3.12e sequence alignment program. This program is part of the 
LASERGENE software package, a suite of molecular biological analysis programs (DNASTAR, 
Madison WI). CX-USTAL V is described in Higgins, D.G. and P.M. Sharp (1989; CABIOS 5: 151- 
153) and m Higgins, D.G. et al. (1992; CABIOS 8:189-191). For pakwise alignments of 
polynucleotide sequences, the default parameters are set as follows: Ktuple=2, gap penalty=5, 
window=4, and "diagonals saved"=4. The "weighted" residue weight table is selected as the default. 

Alternatively, a suite of commonly used and freely available sequence comparison algorithms 
which can be used is provided by the National Center for Biotechnology Information (NCBI) Basic 
Local AUgnment Search Tool (BLAST) (Altschul, S.F, et al. (1990) J. Mol. Biol. 215:403-410), 
which is available from several sources, including the NCBI, Bethesda, MD, and on the Internet at 
http://www.ncbi.nlm.nih.gov/BLAST/. The BLAST software suite includes various sequence 
analysis programs including "blastn," that is used to align a known polynucleotide sequence with 
other polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 2 
Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
Sequences" can be accessed and used interactively at http://www.ncbi.nlm.nih.gov/gorf/bl2.html. 
The "BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 
programs are commonly used with gap and other parameters set to default settings. For exan^)le, to 
compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 
2.0.12 (April-21-2000) set at default parameters. Such default parameters may be, for example: 

Matrix: BLOSUM62 

Reward for match: 1 

Penalty for mismatch: -2 

Open Gap: 5 and Extension Gap: 2 penalties 

Gap X drop-off: 50 

Expect: 10 

Word Size; 11 

Filter: on 
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Percent identity may be measmed over the length of an entire defined sequence, for example, 
as defined by a particular SEQ ID number, or may be measured over a shorter length, for example, 
over the length of a fragment taken from a larger, defined sequence, for instance, a fragment of at 
least 20, at least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous 
nucleotides. Such lengths are exemplary only, and it is understood that any fragment length 
supported by the sequences shown herem, in the tables, figures, or Sequence Listing, may be used to^ 
describe a length over which percentage identity may be measured. 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes 
in a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid 
sequences that all encode substantially the same protein. 

The phrases "percent identity" and "% identity," as applied to polypeptide sequences, refer to 
the percentage of identical residue matches between at least two polypeptide sequences aligned using 
a standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some 
alignment methods take into account conservative amino acid substitutions. Such conservative 
substitutions, explained in more detail above, generally preserve the charge and hydrophobicity at the 
site of substitution, thus preserving the structure (and therefore function) of the polypeptide. The 
phrases "percent similarity" and "% similarity," as applied to polypeptide sequences, refer to the 
percentage of residue matches, including identical residue matches and conservative substitutions, 
between at least two polypeptide sequences aligned using a standardized algorithm. In contrast, 
conservative substitutions are not included in the calculation of percent identity between polypeptide 
sequences. 

Percent identity between polypeptide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12e 
sequence aligimient program (described and referenced above). For pairwise alignments of 
polypeptide sequences using CLUSTAL V, the default parameters are set as follows: Ktuple=l, gap 
penalty=3, wmdow=5, and "diagonals saved"=5. The PAM250 matrix is selected as the default 
residue weight table. 

Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 
comparison of two polypeptide sequences, one may use the **BLAST 2 Sequences" tool Version 
2.0.12 (April-21-2000) with blastp set at default parameters. Such default parameters may be, for 
example: 

Matrix: BLOSUM62 

Open Gap: 11 and Extension Gap: 1 penalties 
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Gap X drop-off: 50 
Expect: 10 
Word Size: 3 
Filter: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, 
for example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 
example, over the length of a fragment taken fi-om a larger, defined polypeptide sequence, for 
instance, a firagment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 
150 contiguous residues. Such lengths are exemplary only, and it is understood that any fragment 
length supported by the sequences shown herein, m the tables, figures or Sequence Listmg, may be 
used to describe a length over which percentage identity may be measured. 

"Human artificial chromosomes" (HACs) are linear microchromosomes which may contain 
DNA sequences of about 6 kb to 10 Mb in size and which contain all of the elements required for 
chromosome replication, segregation and maintenance- 

The term "humanized antibody" refers to an antibody molecule in which the amino acid 
sequence in the non-antigen binding regions has been altered so that the antibody more closely 
resembles a human antibody, and still retains its original binding ability. 

"Hybridization" refers to the process by which a polynucleotide strand anneals with a 
complementary strand through base pairing under defined hybridization conditions. Specific 
hybridization is an indication that two nucleic acid sequences share a high degree of 
complementarity. Specific hybridization complexes form under permissive annealing conditions and 
remain hybridized after the "washing" step(s). The washing step(s) is particularly important in 
determining the stringency of the hybridization process, with more stringent conditions allowing less 
non-specific binding, i.e., binding between pairs of nucleic acid strands that are not perfectly 
matched. Permissive conditions for annealing of nucleic acid sequences are routinely determinable 
by one of ordinary skill in the art and may be consistent among hybridization experiments, whereas 
wash conditions may be varied among experiments to achieve the desired stringency, and therefore 
hybridization specificity. Permissive annealing conditions occur, for example, at 68°C in the 
presence of about 6 x SSC, about 1% (w/v) SDS, and about 100 fig/ml sheared, denatured salmon 
sperm DNA. 

Generally, stringency of hybridization is expressed, in part, with reference to the temperature 
under which the wash step is carried out. Such wash temperatures are typicaDy selected to be about 
5^C to 20*^0 lower than the thermal melting point (T J for the specific sequence at a defined ionic 
strength and pH. The T„ is the temperature (under defined ionic strength and pH) at which 50% of 
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the target sequence hybridizes to a perfectly matched probe. An equation for calculating T^, and 
conditions for nucleic acid hybridization are well known and can be found in Sambrook, J. and D.W. 
Russell (2001; Molecular Cloning: A Laboratory Manual , 3rd ed., vol. 1-3, Cold Spring Harbor Press, 
Cold Spring Harbor NY, ch. 9). 

High stringency conditions for hybridization between polynucleotides of the present 
invention include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0. 1% SDS, 
fori hour. Alternatively, temperatures of about 65°C, 60°C, 55°C, or 42'^C may be used. SSC 
concentration may be varied from about 0.1 to 2 x SSC, with SDS being present at about 0.1%. 
Typically, blocking reagents are used to block non-specific hybridization. Such blocking reagents 
mclude, for instance, sheared and denatured salmon sperm DNA at about 100-200 fig/ml. Organic 
solvent, such as formamide at a concentration of about 35-50% v/v, may also be used under particular 
circumstances, such as for RNArDNA hybridizations. Useful variations on these wash conditions 
will be readily apparent to those of ordinary skill in the art. Hybridization, particularly under high 
stringency conditions, may be suggestive of evolutionary similarity between the nucleotides. Such 
similarity is strongly indicative of a similar role for the nucleotides and their encoded polypeptides. 

The term *liybridization complex" refers to a complex formed between two nucleic acids by 
vutue of the formation of hydrogen bonds between complementary bases. A hybridization complex 
may be formed in solution (e.g.. Cot or R^t analysis) or formed between one nucleic acid present in 
solution and another nucleic acid immobiUzed on a solid support (e.g., paper, membranes, filters, 
chips, pins or glass slides, or any other appropriate substrate to which cells or their nucleic acids have 
been fixed). 

The words "insertion" and "addition" refer to changes in an amino acid or polynucleotide 
sequence resulting in the addition of one or more amino acid residues or nucleotides, respectively. 

"Immune response" can refer to conditions associated with inflammation, trauma, immune 
disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression 
of various factors, e.g., cytokines, chemokines, and other signaling molecules, which may affect 
cellular and systemic defense systems. 

An "immunogenic fragment" is a polypeptide or oHgopeptide fragment of PMMM which is 
capable of eliciting an immune response when introduced into a living organism, for example, a 
mammal. The term "immunogenic fragment" also includes any polypeptide or oligopeptide fragment 
of PMMM which is useful in any of the antibody production methods disclosed herein or known in 
the art. 

The term "microarray" refers to an arrangement of a plurality of polynucleotides, 
polypeptides, antibodies, or other chemical compounds on a substrate. 
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The terms "element" and "array element" refer to a polynucleotide, polypeptide, antibody, or 
other chemical compound having a unique and defined position on a microarray. 

The term "modulate" refers to a change in the activity of PMMM. For example, modulation 
may cause an increase or a decrease in protein activity, binding characteristics, or any other 
biological, functional, or immunological properties of PMMM. 

The phrases "nucleic acid" and '*nucleic acid sequence" refer to a nucleotide, oligonucleotide, 
polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA of genomic or 
synthetic origin which may be single-stranded or double-stranded and may represent the sense or the 
antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-like material. 

"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 
functional relationship with a second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcription or expression of the coding 
sequence. Operably linked DNA sequences may be in close proximity or contiguous and, where 
necessary to join two protein coding regions, in the same reading frame. 

"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of 
amino acid residues ending in lysine. The terminal lysine confers solubility to the composition, 
PNAs preferentially bind complementary single stranded DNA or RNA and stop transcript 
elongation, and may be pegylated to extend their lifespan in the cell. 

*Tost-translational modification" of an PMMM may involve lipidation, glycosylation, 
phosphorylation, acetylation, racemization, proteolytic cleavage, and other modifications known in 
the art. These processes may occur synthetically or biochemically. Biochemical modifications will 
vary by cell type depending on the enzymatic milieu of PMMM. 

"Probe" refers to nucleic acids encoding PMMM, their complements, or fragments thereof, 
which are used to detect identical, allelic or related nucleic acids. Probes are isolated 
oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. Typical 
labels include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. *Trimers" are 
short nucleic acids, usually DNA oligonucleotides, which may be annealed to a target polynucleotide 
by complementary base-pairing. The primer may then be extended along the target DlsTA strand by a 
DNA polymerase enzyme. Primer pairs can be used for amplification (and identification) of a nucleic 
acid, e.g., by the polymerase chain reaction (PGR). 

Probes and primers as used in the present invention typically comprise at least 15 contiguous 
nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also 
be employed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 



38 



wo 03/063688 



PCTAJS03/02500 



or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers 
may be considerably longer than these examples, and it is understood that any length supported by the 
specification, including the tables, figures, and Sequence Listing, may be used. 

Methods for preparing and using probes and primers are described in, for example, 
Sambrook, J. and D.W. Russell (2001; Molecular aoning: A L aboratory Manual, 3rd ed., vol. 1-3, 
Cold Spring Harbor Press, Cold Spring Harbor NY), Ausubel, F.M. et al. (1999; Short Protocols in 
Molecular Bioloev , 4* ed., John Wiley & Sons, New York NY), and Innis, M. et al. (1990; PCR 
Protocols. A Guide to Methods and Applications , Academic Press, San Diego CA). PCR primer pairs 
can be derived fi*om a known sequence, for exan^le, by using computer programs intended for that 
purpose such as Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge 
MA). 

Oligonucleotides for use as primers are selected using software known in the art for such 
purpose. For example, OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 
100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 
5,000 nucleotides firom an input polynucleotide sequence of up to 32 kilobases. Similar primer 
selection programs have incorporated additional features for expanded capabilities. For example, the 
PrimOU primer selection program (available to the public from the Genome Center at University of 
Texas South West Medical Center, Dallas TX) is capable of choosing specific primers from 
megabase sequences and is thus useful for designing primers on a genome-wide scope. The Primer3 
primer selection program (available to the public from the Whitehead Institute/Mrr Center for 
Genome Research, Cambridge MA) allows the user to input a "mispriming library," in which 
sequences to avoid as primer binding sites are user-specified. PrimerS is useful, in particular, for the 
selection of oligonucleotides for microarrays. (The source code for the latter two primer selection 
programs may also be obtained from their respective sources and modified to meet the user's specific 
needs.) The PrimeGen program (available to the public from the UK Human Genome Mapping 
Project Resource Centre, Cambridge UK) designs primers based on multiple sequence alignments, 
thereby allowing selection of primers that hybridize to either the most conserved or least conserved 
regions of aligned nucleic acid sequences. Hence, this program is useful for identification of both 
unique and conserved oligonucleotides and polynucleotide fragments. The oligonucleotides and 
polynucleotide fragments identified by any of the above selection methods are useful m hybridization 
technologies, for example, as PCR or sequencing primers, microarray elements, or specific probes to 
identify fully or partially complementary polynucleotides in a sample of nucleic acids. Methods of 
oligonucleotide selection are not limited to those described above. 
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A "recombinant nucleic acid" is a nucleic acid that is not naturally occurring or has a 
sequence that is made by an artificial combination of two or more otherwise separated segments of 
sequence. This artificial combination is often accomplished by chemical synthesis or, more 
cormnonly, by the artificial manipulation of isolated segments of nucleic acids, e.g., by genetic 
engineering techniques such as those described in Sambrook and Russell {supra). The term 
recombinant includes nucleic acids that have been altered solely by addition, substitution, or deletion 
of a portion of the nucleic acid. Frequently, a recombinant nucleic acid may include a nucleic acid 
sequence operably linked to a promoter sequence. Such a recombinant nucleic acid may be part of a 
vector that is used, for example, to transform a cell. 

Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 
vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 
expressed, inducing a protective immunological response in the mammal. 

A "regulatory element" refers to a nucleic acid sequence usually derived from untranslated 
regions of a gene and includes enhancers, promoters, introns, and 5' and 3* untranslated regions 
(UTRs). Regulatory elements interact with host or viral proteins which control transcription, 
translation, or RNA stability. 

"Reporter molecules" are chemical or biochemical moieties used for labeling a nucleic acid, 
amino acid, or antibody. Reporter molecules include radionuclides; enzymes; fluorescent, 
chemiluminescent, or chromogenic agents; substrates; cofactors; inhibitors; magnetic particles; and 
other moieties known in the art. 

An "RNA equivalent," in reference to a DNA molecule, is composed of the same linear 
sequence of nucleotides as the reference DNA molecule with the exception that all occurrences of the 
nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose 
instead of deoxyribose. 

The term "sample" is used in its broadest sense. A sample suspected of containing PMMM, 
nucleic acids encoding PMMM, or fragments thereof may comprise a bodily fiuid; an extract fi-om a 
cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic DNA, RNA, or 
cDNA, in solution or bound to a substrate; a tissue; a tissue print; etc. 

The terms "specific binding" and "specifically binding" refer to that interaction between a 
protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 
synthetic binding composition. The interaction is dependent upon the presence of a particular 
structure of the protein, e.g., the antigenic determinant or epitope, recognized by the binding 
molecule. For example, if an antibody is specific for epitope "A," the presence of a polypeptide 
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comprising the epitope A, or the presence of free unlabeled A, in a reaction containing free labeled A 
and the antibody will reduce the amount of labeled A that binds to the antibody. 

The term "substantially purified" refers to nucleic acid or amino acid sequences that are 
removed from their natural environment and are isolated or separated, and are at least about 60% free, 
preferably at least about 75% free, and most preferably at least about 90% free from other 
components with which they are naturally associated. 

A "substitution" refers to the replacement of one or more amino acid residues or nucleotides 
by different amino acid residues or nucleotides, respectively. 

"Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 
chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

A "transcript image" or "expression profile" refers to the collective pattern of gene 
expression by a particular cell type or tissue under given conditions at a given time. 

"Transformation" describes a process by which exogenous DNA is introduced into a recipient 
cell. Transformation may occur under natural or artificial conditions according to various methods 
well known in the art, and may rely on any known method for the insertion of foreign nucleic acid 
sequences into a prokaryotic or eukaryotic host cell. The method for transformation is selected based 
on the type of host cell being transformed and may include, but is not limited to, bacteriophage or 
viral infection, electroporation, heat shock, lipofection, and particle bombardment. The term 
"transformed cells" includes stably transformed cells in which the inserted DNA is capable of 
replication either as an autonomously replicating plasmid or as part of the host chromosome, as well 
as transiently transformed cells which express the inserted DNA or RNA for limited periods of time. 

A "transgenic organism," as used herein, is any organism, including but not limited to 
animals and plants, in which one or more of the cells of the organism contains heterologous nucleic 
acid introduced by way of human intervention, such as by transgenic techniques well known in the 
art. The nucleic acid is introduced into the cell, directly or indirectly by introduction into a precursor 
of the cell, by way of deliberate genetic manipulation, such as by microinjection or by infection with 
a recombinant viras. In another embodiment, the nucleic acid can be introduced by infection with a 
recombinant viral vector, such as a lentiviral vector (Lois, C. et al. (2002) Science 295:868-872). The 
term genetic manipulation does not include classical cross-breeding, or in vitro fertilization, but 
rather is directed to the introduction of a recombinant DNA molecule. The transgenic organisms 
contemplated in accordance with the present invention include bacteria, cyanobacteria, fungi, plants 
and animals. The isolated DNA of the present invention can be introduced into the host by methods 
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known in the art, for example infection, transfection, transformation or transconjugation. Techniques 
for transferring the DNA of the present invention into such organisms are widely known and 
provided in references such as Sambrook and Russell {supra). 

A '"variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having 
at least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of 
the nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
1999) set at default parameters. Such a pak of nucleic acids may show, for example, at least 50%, at 
least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 
93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% or greater 
sequence identity over a certain defined length. A variant may be described as, for example, an 
"allelic" (as defined above), "splice," "species," or "polymorphic" variant. A splice variant may have 
significant identity to a reference molecule, but will generally have a greater or lesser number of 
polynucleotides due to alternate splicing during mRNA processing. The corresponding polypeptide 
may possess additional functional domains or lack domains that are present in the reference molecule. 
Species variants are polynucleotides that vary firom one species to another. The resulting 
polypeptides will generally have significant amino acid identity relative to each other. A 
polymorphic variant is a variation in the polynucleotide sequence of a particular gene between 
individuals of a given species. Polymorphic variants also may encompass "single nucleotide 
polymorphisms" (SNPs) in which the polynucleotide sequence varies by one nucleotide base. The 
presence of SNPs may be indicative of, for example, a certain population, a disease state, or a 
propensity for a disease state. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having 
at least 40% sequence identity or sequence similarity to the particular polypeptide sequence over a 
certain length of one of the polypeptide sequences using blastp with the "BLAST 2 Sequences" tool 
Version 2.0.9 (May'07-1999) set at default parameters. Such a pair of polypeptides may show, for 
example, at least 50%, at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, 
or at least 99% or greater sequence identity or sequence similarity over a certain defined length of one 
of the polypeptides. 

THE INVENTION 

Various embodiments of the invention include new human protein modification and 
maintenance molecules (PMMM), the polynucleotides encoding PMMM, and the use of these 
compositions for the diagnosis, treatment, or prevention of gastrointestmal, cardiovascular. 
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autoimmune/mflammatory, cell proliferative, developmental, epithelial, neurological, and 
reproductive disorders. 

Table 1 summarizes the nomenclature for the full length polynucleotide and polypeptide 
embodiments of the invention. Each polynucleotide and its corresponding polypeptide are correlated 
to a single Incyte project identification number (Ihcyte Project ID). Each polypeptide sequence is 
denoted by both a polypeptide sequence identification number (Polypeptide SEQ ID NO:) and an 
Incyte polypeptide sequence number (Incyte Polypeptide ID) as shown. Each polynucleotide 
sequence is denoted by both a polynucleotide sequence identification number (Polynucleotide SEQ 
ID NO:) and an Ihcyte polynucleotide consensus sequence number (Incyte Polynucleotide ID) as 
shown. Column 6 shows the Incyte ID numbers of physical, full length clones corresponding to the 
polypeptide and polynucleotide sequences of the invention. The full length clones encode 
polypeptides which have at least 95% sequence identity to the polypeptide sequences shown in 
column 3. 

Table 2 shows sequences with homology to polypeptide embodiments of the invention as 
identified by BLAST analysis against the GenBank protem (genpept) database and the PROTEOME 
database. Columns 1 and 2 show die polypeptide sequence identification number (Polypeptide SEQ 
ID NO:) and the corresponding Incyte polypeptide sequence number (lacyte Polypeptide ID) for 
polypeptides of the invention. Column 3 shows the GenBank identification number (GenBank ID 
NO:) of the nearest GenBank homolog and the PROTEOME database identification numbers 
(PROTEOME ID NO:) of the nearest PROTEOME database homologs. Column 4 shows the 
probability scores for the matches between each polypeptide and its homolog(s). Column 5 shows the 
annotation of the GenBank and PROTEOME database homolog(s) along with relevant citations 
where applicable, aU of which are expressly incorporated by reference herein. 

Table 3 shows various structural features of the polypeptides of the invention. Columns 1 
and 2 show the polypeptide sequence identification number (SEQ ID NO:) and the corresponding 
Incyte polypeptide sequence number (Incyte Polypeptide ID) for each polypeptide of the invention. 
Column 3 shows the number of amino acid residues in each polypeptide. Column 4 shows potential 
phosphorylation sites, and column 5 shows potential glycosylation sites, as determined by the 
MOTIFS program of the GCG sequence analysis software package (Accelrys, Burlington MA). 
Column 6 shows amino acid residues comprising signature sequences, domains, and motifs. Column 
7 shows analytical methods for protem structure/function analysis and m some cases, searchable 
databases to which the analytical methods were applied. 

Together, Tables 2 and 3 summarize the properties of polypeptides of the invention, and these 
properties establish that the claimed polypeptides are protein modification and maintenance 
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molecules. For example, SEQ ID NO:3 is 100% identical, from residue Ml to residue K696, to 
human ubiquitin-activating enzyme E14ike protein (GenBank ID gl3623539) as determined by the 
Basic Local Alignment Search Tool (BLAST). (See Table 2.) The BLAST probability score is 0, 
which indicates the probability of obtaining the observed polypeptide sequence alignment by chance. 
SEQ ID NO:3 has homology to proteins that are ubiquitin-activatmg enzyme El type protems, as 
determined by BLAST analysis using the PROTEOME database. SEQ ID NO:3 also contains a ThiF 
and a ubiquitin-activating (UBA) protein repeat domain as determined by searching for statistically 
significant matches in the hidden Markov model (HMM)-based PFAM database of conserved protein 
family domains. (See Table 3.) Data from BLIMPS, MOTIFS, BLAST and PROFBLESCAN 
analyses provide further corroborative evidence that SEQ ID NO:3 is a ubiquitin-activating enzyme. 
In an alternative example, SEQ ID NO: 14 is 100% identical, from residue E162 to residue L345 and 
is 97% identical, from residue Ml to residue D165 to human neuroserpin (GenBank ID gl785654) as 
determined by the Basic Local Alignment Search Tool (BLAST). (See Table 2.) The BLAST 
probability scores are 5.5E-176, which indicate the probabilities of obtaining the observed 
polypeptide sequence alignments by chance. As determined by BLAST analysis using the 
PROTEOME database, SEQ ID NO: 14 has homology to human and murine neuroserpin, members of 
the serpin family of serine protease inhibitors. Murine neuroserpin may attenuate extracellular 
proteolysis associated with neuronal migration, axogenesis, or synaptic connection formation during 
development. SEQ ID NO: 14 also contains serpin and serine protease inhibitor domains as 
determined by searching for statistically significant matches in the hidden Markov model (HMM)- 
based PFAM database of conserved protein family domains. (See Table 3.) Data from BLIMPS, 
MOTIFS, PROFILESCAN, and additional BLAST analyses provide further corroborative evidence 
that SEQ ID NO: 14 is a neuroserpin. In an alternative example, SEQ ID NO:28 is 99% identical, 
from residue Ml to residue W124, and 93% identical, from residue ClOl to residue K444, to human 
coagulation factor X (GenBank ID gl82390, from Ml to F124 and from C140 to K488, respectively) 
as determined by the Basic Local Alignment Search Tool (BLAST). (See Table 2.) The BLAST 
probability score is 1.6e~246, which indicates the probability of obtaining the observed polypeptide 
sequence alignment by chance. SEQ ID NO:28 also has homology to proteins that are extracellular, 
have vitamin K-dependent serine protease activity, and are coagulation factor X proteins, as 
determined by BLAST analysis using the PROTEOME database. SEQ ID NO:28 also contains 
trypsin, vitamin K-dependent carboxylation and EGF-like domains as determined by searchmg for 
statistically significant matches in the hidden Markov model (HMM)-based PFAM database of 
conserved protein families/domains. (See Table 3.) Data from BLIMPS, MOTIFS, BLASTand 
PROFILESCAN analyses provide further corroborative evidence that SEQ ID NO:28 is a coagulation 
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factor X protein. SEQ ID NO: 1-2, SEQ ID NO:4-13, SEQ ID NO: 15-27, and SEQ ID NO:29-58 were 
analyzed and annotated in a similar manner. The algorithms and parameters for the analysis of SEQ 
ID NO: 1-58 are described in Table 7. 

As shown in Table 4, the full length polynucleotide embodiments were assembled using 
cDNA sequences or coding (exon) sequences derived from genomic DNA, or any combination of 
these two types of sequences. Column 1 lists the polynucleotide sequence identification number 
(Polynucleotide SEQ ID NO:), the corresponding incyte polynucleotide consensus sequence number 
(Incyte ID) for each polynucleotide of the invention, and the length of each polynucleotide sequence 
in basepairs. Column 2 shows the nucleotide start (5') and stop (3') positions of the cDNA and/or 
genomic sequences used to assemble the fiill length polynucleotide embodiments, and of fragments of 
the polynucleotides which are useful, for example, in hybridization or amplification technologies that 
identify SEQ ID NO:59-l 16 or that distinguish between SEQ ID NO:59-l 16 and related 
polynucleotides. 

The polynucleotide fragments described in Column 2 of Table 4 may refer specifically, for 
example, to Incyte cDNAs derived from tissue-specific cDNA libraries or fi*om pooled cDNA 
libraries. Alternatively, the polynucleotide fragments described in column 2 may refer to GenBank 
cDNAs or ESTs which contributed to the assembly of the full length polynucleotides. In addition, the 
polynucleotide fragments described in column 2 may identify sequences derived fiom the ENSEMBL 
(The Sanger Centre, Cambridge, UK) database those sequences including the designation 
'*ENST"). Alternatively, the polynucleotide fragments described in column 2 may be derived from 
the NCBI RefSeq Nucleotide Sequence Records Database (Le., those sequences including the 
designation "NM" or "NT") or the NCBI RefSeq Protein Sequence Records (i.e., those sequences 
including the designation "NP"). Alternatively, the polynucleotide fragments described in column 2 
may refer to assemblages of both cDNA and Genscan-predicted exons brought together by an "exon 
stitching" algorithm. For example, a polynucleotide sequence identified as 
¥LJOOOOOiJNj_N2J^YYYY_N3_N4 represents a "stitched" sequence m which XXXXXX is the 
identification number of the cluster of sequences to which the algorithm was applied, and YYYYY is 
the number of the prediction generated by the algorithm, and Nj^2,3.,.-> if present, represent specific 
exons that may have been manually edited during analysis (See Example V). Alternatively, the 
polynucleotide fragments in column 2 may refer to assemblages of exons brought together by an 
"exon-stretching" algorithm. For example, a polynucleotide sequence identified as 
FLJOOOCXX.^,gMAAA^BBBB_l_N is a "stretched" sequence, with XXXXXX being the Incyte 
project identification number, gAAAAA being the GenBank identification number of the hunoan 
genomic sequence to which the "exon-stretching" algorithm was applied, gBBBBB being the 
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GenBank idcBtification number or NCBI RefSeq identification number of the nearest GenBank 
protein homolog, and iV referring to specific exons (See Exan]q)le V). In instances where a RefSeq 
sequence was used as a protein homolog for the "exon-stretching" algorithm, a RefSeq identifier 
(denoted by "NM," **NP," or *TS[T") may be used in place of the GenBank identifier (Le,, gBBBBB). 

Alternatively, a prefix identifies component sequences that were hand-edited, predicted firom 
genomic DNA sequences, or derived from a combination of sequence analysis methods. The 
following Table lists examples of component sequence prefixes and corresponding sequence analysis 
methods associated with the prefixes (see Example IV and Example V). 



Prefix 


Type of analysis and/or examples of programs 


GNN, GFG, 
ENST 


Exon prediction from genomic sequences using, for example, 
GENSCAN (Stanford University, CA, USA) or FGENES 
(Computer Genomics Group, The Sanger Centre, Cambridge, UK). 


GBI 


Hand-edited analysis of genomic sequences. 


FL 


Stitched or stretched genonoic sequences (see Example V). 


INCY 


Full length transcript and exon prediction from mapping of EST 
sequences to the genome. Genomic location and EST composition 
data are combined to predict the exons and resulting transcript. 



In some cases, Incyte cDNA coverage redundant with the sequence coverage shown in Table 
4 was obtained to confirm the final consensus polynucleotide sequence, but the relevant Incyte cDNA 
identification numbers are not shown. ^ 

Table 5 shows the representative cDNA libraries for those full length polynucleotides which 
were assembled using Incyte cDNA sequences. The representative cDNA library is the Incyte cDNA 
library which is most frequently represented by the Incyte cDNA sequences which were used to 
assemble and confirm the above polynucleotides. The tissues and vectors which were used to 
construct the cDNA libraries shown in Table 5 are described in Table 6. 

Table 8 shows single nucleotide polymorphisms (SNPs) found in polynucleotide sequences of 
the invention, along with allele frequencies in different human populations. Columns 1 and 2 show 
the polynucleotide sequence identification number (SEQ ID NO:) and the corresponding Incyte 
project identification number (PID) for polynucleotides of the invention. Column 3 shows the Incyte 
identification number for the EST in which the SNP was detected (EST ID), and column 4 shows the 
identification number for the SNP (SNP ID). Column 5 shows the position within the EST sequence 
at which the SNP is located (EST SNP), and column 6 shows the position of the SNP within the full- 
length polynucleotide sequence (CBl SNP). Column 7 shows the allele found in the EST sequence. 
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Columns 8 and 9 show the two alleles found at the SNP site. Colunm 10 shows the ammo acid 
encoded by the codon including the SNP site, based upon the allele found in the EST. Colmnns 11- 
14 show the frequency of allele 1 in four different human populations. An entry of n/d (not detected) 
indicates that the frequency of allele 1 in the population was too low to be detected, while n/a (not 
available) indicates that the allele frequency was not determined for the population . 

The invention also encompasses PMMM variants. Various embodiments of PMMM variants 
can have at least about 80%, at least about 90%, or at least about 95% amino acid sequence identity to 
the PMMM amino acid sequence, and can contain at least one ftinctional or structural characteristic 
of PMMM. 

Various embodiments also encompass polynucleotides which encode PMMM. In a particular 
embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected 
from the group consisting of SEQ ID NO:59-l 16, which encodes PMMM. The polynucleotide 
sequences of SEQ ID NO:59-116, as presented in the Sequence Listing, embrace the equivalent RNA 
sequences, wherein occurrences of the nitrogenous base thymine are replaced with uracil, and the 
sugar backbone is composed of ribose instead of deoxyribose. 

The invention also encompasses variants of a polynucleotide encoding PMMM. In particular, 
such a variant polynucleotide will have at least about 70%, or alternatively at least about 85%, or 
even at least about 95% polynucleotide sequence identity to a polynucleotide encoding PMMM. A 
particular aspect of the invention encompasses a variant of a polynucleotide comprising a sequence 
selected from the group consisting of SEQ ID NO:59-116 which has at least about 70%, or 
alternatively at least about 85%, or even at least about 95% polynucleotide sequence identity to a 
nucleic acid sequence selected from the group consisting of SEQ ID NO:59-l 16. Any one of the 
polynucleotide variants described above can encode a polypeptide which contains at least one 
functional or structiural characteristic of PMMM. 

In addition, or in the alternative, a polynucleotide variant of the invention is a splice variant 
of a polynucleotide encoding PMMM. A splice variant may have portions which have significant 
sequence identity to a polynucleotide encoding PMMM, but will generally have a greater or lesser 
number of polynucleotides due to additions or deletions of blocks of sequence arising from alternate 
splicing during mRNA processing. A splice variant may have less than about 70%, or alternatively 
less than about 60%, or alternatively less than about 50% polynucleotide sequence identity to a 
polynucleotide encoding PMMM over its entire length; however, portions of the splice variant will 
have at least about 70%, or alternatively at least about 85%, or alternatively at least about 95%, or 
alternatively 100% polynucleotide sequence identity to portions of the polynucleotide encoding 
PMMM. For example, a polynucleotide comprising a sequence of SEQ ID NO:76, a polynucleotide 



47 



wo 03/063688 



PCTAJS03/02500 



comprising a sequence of SEQ ID NO:77, a polynucleotide conoprising a sequence of SEQ ID NO:78, 
a polynucleotide comprising a sequence of SEQ ID NO: 89, a polynucleotide comprising a sequence 
of SEQ ID NO:90, and a polynucleotide comprising a sequence of SEQ ID NO: 104 are splice variants 
of each other; a polynucleotide comprising a sequence of SEQ ID NO:81 and a polynucleotide 
comprising a sequence of SEQ ID NO: 82 are splice variants of each other; a polynucleotide 
comprising a sequence of SEQ ID NO:86, a polynucleotide comprising a sequence of SEQ ID NO:87, 
and a polynucleotide comprising a sequence of SEQ ID NO:88 are splice variants of each other; a 
polynucleotide comprising a sequence of SEQ ID NO:98 and a polynucleotide comprising a sequence 
of SEQ ID NO: 115 are splice variants of each other; a polynucleotide comprising a sequence of SEQ 
ID NO:101, a polynucleotide comprising a sequence of SEQ ID NO: 103, and a polynucleotide 
comprising a sequence of SEQ ID NO: 105 are splice variants of each other; a polynucleotide 
comprising a sequence of SEQ ID NO: 108, a polynucleotide comprising a sequence of SEQ ID 
NO: 109, and a polynucleotide comprising a sequence of SEQ ID NO: 110 are splice variants of each 
other; and a polynucleotide comprising a sequence of SEQ ID NO: 1 12 and a polynucleotide 
comprising a sequence of SEQ ID NO: 1 13 are splice variants of each other. Any one of the splice 
variants described above can encode a polypeptide which contains at least one functional or structural 
characteristic of PMMM. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of the 
genetic code, a multitude of polynucleotide sequences encoding PMMM, some bearing minimal 
similarity to the polynucleotide sequences of any known and naturally occurring gene, may be 
produced. Thus, the invention contemplates each and every possible variation of polynucleotide 
sequence that could be made by selecting combinations based on possible codon choices. These 
combinations are made in accordance with the standard triplet genetic code as applied to the 
polynucleotide sequence of naturally occurring PMMM, and all such variations are to be considered 
as being specifically disclosed. 

Although polynucleotides which encode PMMM and its variants are generally capable of 
hybridizing to polynucleotides encoding naturally occurring PMMM under appropriately selected 
conditions of stringency, it may be advantageous to produce polynucleotides encoding PMMM or its 
derivatives possessing a substantially different codon usage, e.g., inclusion of non-naturally occurring 
codons. Codons may be selected to increase the rate at which expression of the peptide occurs in a 
particular prokaryotic or eukaryotic host in accordance with the frequency with which particular 
codons are utilized by the host. Other reasons for substantially altering the nucleotide sequence 
encoding PMMM and its derivatives without altering the encoded anoino acid sequences include the 
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production of RNA transcripts having more desirable properties, such as a greater half-life, than 
transcripts produced from the. naturally occurring sequence. 

The invention also encompasses production of polynucleotides which encode PMMM and 
PMMM derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the 
synthetic polynucleotide may be inserted into any of the many available expression vectors and cell 
systems using reagents well known in the art. Moreover, synthetic chemistry may be used to 
introduce mutations into a polynucleotide encoding PMMM or any fragment thereof. 

Embodiments of the invention can also include polynucleotides that are capable of 
hybridizing to the claimed polynucleotides, and, in particular, to those having the sequences shown in 
SEQ ID NO:59-116 and fragments thereof, under various conditions of stringency (Wahl, G.M. and 
S.L. Berger (1987) Methods EnzymoL 152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 
152:507-5 11), Hybridization conditions, including annealing and wash conditions, are described in 
'T)efinitions." 

Methods for DNA sequencing are well known in the art and may be used to practice any of 
the embodiments of the invention. The methods may employ such enzymes as the Klenow fragment 
of DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (Applied 
Biosystems), thermostable T7 polymerase (Amersham Biosciences, Piscataway NJ), or combinations 
of polymerases and proofreading exonucleases such as those found in the ELONGASE amplification 
system (Invitrogen, Carlsbad CA). Preferably, sequence preparation is automated with machines such 
as the MICROLAB 2200 liquid transfer system (Hamilton, Reno NV), PTC200 thermal cycler (MJ 
Research, Watertown MA) and ABI CATALYST 800 thermal cycler (Applied Biosystems). 
Sequencing is then carried out using either the ABI 373 or 377 DNA sequencing system (Applied 
Biosystems), the MEGABACE 1000 DNA sequencmg system (Amersham Biosciences), or other 
systems known in the art. The resulting sequences are analyzed using a variety of algorithms which 
are well known in the art (Ausubel et al., supra, ch. 7; Meyers, R.A. (1995) Molecular Biologv and 
Biotechnology , Wiley VCH, New York NY, pp. 856-853). 

The nucleic acids encoding PMMM may be extended utilizing a partial nucleotide sequence 
and employing various PCR-based methods known in the art to detect upstream sequences, such as 
promoters and regulatory elements. For example, one method which may be employed, 
restriction-site PGR, uses universal and nested primers to amplify unknown sequence from genomic 
DNA within a clonmg vector (Sarkar, G. (1993) PGR Methods Applic. 2:318-322). Another method, 
inverse PGR, uses primers that extend in divergent directions to amplify unknown sequence from a 
circularized template. The template is derived from restriction fragments comprising a known 
genomic locus and surrounding sequences (Triglia, T. et al. (1988) Nucleic Acids Res. 16:8186). A 
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third method, capture PGR, involves PGR amplification of DNA fragments adjacent to known 
sequences in human and yeast artificial chromosome DNA (Lagerstrom, M. et al. (1991) PGR 
Methods Applic. 1:111-119). In this method, multiple restriction enzyme digestions and ligations 
may be used to insert an engineered double-stranded sequence into a region of unknown sequence 
before performing PGR. Other methods which may be used to retrieve unknown sequences are 
known in the art (Parker, J.D. et al. (1991) Nucleic Acids Res. 19:3055-3060). AdditionaUy, one may 
use PGR, nested primers, and PROMOTERFINDER libraries (Glontech, Palo Alto GA) to walk 
genomic DNA. This procedure avoids the need to screen libraries and is useful in finding intron/exon 
junctions. For all PGR-based methods, primers may be designed using commercially available 
software, such as OLIGO 4.06 primer analysis software (National Biosciences, Plymouth MN) or 
another appropriate program, to be about 22 to 30 nucleotides in length, to have a GC content of 
about 50% or more, and to anneal to the template at temperatures of about 68°C to 72'*C. 

When screening for full length cDNAs, it is preferable to use libraries that have been 
size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
sequences containing the 5' regions of genes, are preferable for situations in which an oligo d(T) 
library does not yield a full-length cDNA. Genomic libraries may be useful for extension of sequence 
into 5' non-transcribed regulatory regions. 

Capillary electrophoresis systems which are commercially available may be used to analyze 
the size or confirm the nucleotide sequence of sequencing or PGR products. In particular, capillary ■ 
sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide- 
specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Applied Biosystems), and the entire 
process jfrom loading of samples to computer analysis and electronic data display may be computer 
controlled. Capillary electrophoresis is especially preferable for sequencing small DNA fragments 
which may be present in limited amounts in a particular sarr^le. 

In another embodiment of the invention, polynucleotides or fragments thereof which encode 
PMMM may be cloned in recombinant DNA molecules that direct expression of PMMM, or 
fragments or functional equivalents thereof, in appropriate host cells. Due to the inherent degeneracy 
of the genetic code, other polynucleotides which encode substantially the same or a functionally 
equivalent polypeptides may be produced and used to express PMMM. 

The poljoiucleotides of the invention can be engineered using methods generally known in 
the art in order to alter PMMM-encoding sequences for a variety of purposes including, but not 
limited to, modification of the cloning, processing, and/or expression of the gene product. DNA 
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shuffling by random fragmentation and PGR reassembly of gene fragments and synthetic 
oligonucleotides may be used to engineer the nucleotide sequences. For example, oligonucleotide- 
mediated site-directed mutagenesis may be used to introduce mutations that create new restriction 
sites, alter glycosylation patterns, change codon preference, produce splice variants, and so forth. 

The nucleotides of the present invention may be subjected to DNA shuffling techniques such 
as MOLECULARBREEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent No. 
5,837,458; Chang, C.-C. et aL (1999) Nat. Biotechnol. 17:793-797; Christians, F.C. et al. (1999) Nat. 
Biotechnol. 17:259-264; and Crameri, A. et al. (1996) Nat Biotechnol. 14:315-319) to alter or 
improve the biological properties of PMMM, such as its biological or enzymatic activity or its ability 
to bind to other molecules or compounds. DNA shuffling is a process by which a library of gene 
variants is produced using PCR-mediated recombination of gene fragments. The library is then 
subjected to selection or screening procedures that identify those gene variants with the desired 
properties. These preferred variants may then be pooled and further subjected to recursive rounds of 
DNA shuffling and selection/screening. Thus, genetic diversity is created through "artificial" 
breeding and rapid molecular evolution. For example, fragments of a single gene containing random 
point mutations may be recombined, screened, and then reshuffled until the desired properties are 
optimized. Alternatively, fragments of a given gene may be recombined with fragments of 
homologous genes in the same gene family, either from the same or different species, thereby 
maximizing the genetic diversity of multiple naturally occurring genes in a directed and controllable 
manner. 

In another embodiment, polynucleotides encoding PMMM may be synthesized, in whole or in 
part, using one or more chemical methods well known in the art (Caruthers, M.H. et al. (1980) 
Nucleic Acids Symp. Ser. 7:215-223; Hom, T. et al. (1980) Nucleic Acids Symp. Ser. 7:225-232). 
Alternatively, PMMM itself or a fragment thereof may be synthesized using chemical methods known 
in the art. For example, peptide synthesis can be performed using various solution-phase or 
solid-phase techniques (Creighton, T. (1984) Proteins. Structures and Molecular Properties , WH 
Freeman, New York NY, pp. 55-60; Roberge, J,Y. et al. (1995) Science 269:202-204). Automated 
synthesis may be achieved using the ABI 431 A peptide synthesizer (Applied Biosystems). 
Additionally, the amino acid sequence of PMMM, or any part thereof, may be altered during direct 
synthesis and/or combined with sequences from other proteins, or any part thereof, to produce a 
variant polypeptide or a polypeptide having a sequence of a naturally occurring polypeptide. 

The peptide may be substantially purified by preparative high performance liquid 
chromatography (Chiez, R.M. and F.Z. Regnier (1990) Methods Enzymol. 182:392-421). The 
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composition of the synthetic peptides may be confiimed by amino acid analysis or by sequencing 
(Creighton, supra^ pp. 28-53). 

In order to express a biologically active PMMM, the polynucleotides encoding PMMM or 
derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector which contains 
the necessary elements for transcriptional and translational control of the inserted coding sequence in 
a suitable host. These elements include regulatory sequences, such as enhancers, constitutive and 
inducible promoters, and 5' and 3' untranslated regions in the vector and in polynucleotides encoding 
PMMM. Such elements may vary in their strength and specificity. Specific initiation signals may 
also be used to achieve more efficient translation of polynucleotides encoding PMMM. Such signals 
include the ATG initiation codon and adjacent sequences, e.g. the Kozak sequence. In cases where a 
polynucleotide sequence encoding PMMM and its initiation codon and upstream regulatory 
sequences are inserted into the appropriate expression vector, no additional transcriptional or 
translational control signals may be needed. However, in cases where only coding sequence, or a 
fiiagment thereof, is inserted, exogenous translational control signals including an in-frame ATG 
initiation codon should be provided by the vector. Exogenous translational elements and initiation 
codons may be of various origins, both natural and synthetic. The efficiency of expression may be 
enhanced by the inclusion of enhancers appropriate for the particular host cell system used (Scharf , 
D. et al. (1994) Results Probl. Cell Differ. 20:125-162). 

Methods which are well known to those skilled in the art may be used to construct expression 
vectors containing polynucleotides encoding PMMM and appropriate transcriptional and translational 
control elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, 
and in vivo genetic recombination (Sambrook and Russell, supra, ch. 1-4, and 8; Ausubel et al., 
supra, ch. 1, 3, and 15). 

A variety of expression vector/host systems may be utilized to contain and express 
polynucleotides encoding PMMM. These include, but are not limited to, microorganisnis such as 
bacteria transformed with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; 
yeast transformed with yeast expression vectors; insect cell systems infected with viral expression 
vectors (e.g., baculovirus); plant cell systems transformed with viral expression vectors (e.g., 
cauliflower mosaic virus, CaMV, or tobacco mosaic virus, TMV) or with bacterial expression vectors 
(e.g., Ti or pBR322 plasmids); or animal cell systems (Sambrook and Russell, supra; Ausubel et al., 
supra; Van Heeke, G. and S.M. Schuster (1989) J. Biol. Chem. 264:5503-5509; Engelhard, E.K. et al. 
(1994) Proc. Natl. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937- 
1945; Takamatsu, N. (1987) EMBO J, 6:307-311; The McGraw Hill Yearbook of Science and 
Technology (1992) McGraw Hill, New York NY, pp. 191-196; Logan, J. and T. Shenk (1984) Proc. 
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Natl. Acad. Sci. USA 81:3655-3659; Hairington, JJ. et al. (1997) Nat. Genet. 15:345-355). 
Expression vectors derived from retroviruses, adenoviruses, or herpes or vaccinia viruses, or from 
various bacterial plasmids, may be used for delivery of polynucleotides to the targeted organ, tissue, 
or cell population (Di Nicola, M. et al. (1998) Cancer Gen. Ther. 5:350-356; Yu, M. et al. (1993) 
Proc. Natl. Acad. Sci. USA 90:6340-6344; BuUer; R.M. et al. (1985) Nature 317:813-815; McGregor, 
D.P. et al. (1994) Mol. Immunol. 31:219-226; Verma, I.M. and N. Somia (1997) Nature 389:239- 
242). The invention is not limited by the host cell employed. 

In bacterial systems, a number of cloning and expression vectors may be selected depending 
upon the use intended for polynucleotides encoding PMMM. For example, routine cloning, 
subcloning, and propagation of polynucleotides encoding PMMM can be achieved using a 
multifunctional E. coli vector such as PBLUESCRIPT (Stratagene, La JoUa CA) or PSPORTl 
plasmid (Invitrogen). Ligation of polynucleotides encoding PMMM into the vector's multiple 
cloning site disrupts the lacZ gene, allowing a colorimetric screening procedure for identification of 
transformed bacteria containing recombinant molecules. In addition, these vectors may be useful for 
in vitro transcription, dideoxy sequencing, single strand rescue with helper phage, and creation of 
nested deletions in the cloned sequence (Van Heeke, G. and S.M. Schuster (1989) J. Biol. Chem. 
264:5503-5509). When large quantities of PMMM are needed, e.g. for the production of antibodies, 
vectors which direct high level expression of PMMM may be used. For example, vectors containing 
the strong, inducible SP6 or T7 bacteriophage promoter may be used. 

Yeast expression systems may be used for production of PMMM. A number of vectors 
containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 
promoters, may be used in the yeast Saccharomyces cerevisiae or Pichia pastoris. In addition, such 
vectors direct either the secretion or intracellular retention of expressed proteins and enable 
integration of foreign polynucleotide sequences into the host genome for stable propagation (Ausubel 
et al., supra\ Bitter, G.A. et al. (1987) Methods Enzymol. 153:516-544; Scorer, CA. et al. (1994) 
Bio/Technology 12:181-184). 

Plant systems may also be used for expression of PMMM. Transcription of polynucleotides 
encoding PMMM may be driven by viral promoters, e.g., the 35S and 19S promoters of CaMV used 
alone or in combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 
6:307-311). Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock 
promoters may be used (Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et al. (1984) 
Science 224:838-843; Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105). These 
constructs can be introduced into plant cells by dkect DNA transformation or pathogen-mediated 
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transfection (The McGraw HUl Yearbook of Science and Technology (1992) McGraw Hill, New 
York NY, pp. 191-196). 

In mammalian cells, a number of viral-based expression systems may be utilized. In cases 
where an adenovirus is used as an expression vector, polynucleotides encoding PMMM may be 
ligated into an adenovirus transcription/translation complex consisting of the late promoter and 
tripartite leader sequence. Insertion in a non-essential El or E3 region of the viral genome may be 
used to obtain infective virus which expresses PMMM in host cells (Logan, J. and T. Shenk (1984) 
Proc. Natl. Acad. Sci. USA 81:3655-3659). In addition, transcription enhancers, such as the Rous 
sarcoma virus (RSV) enhancer, may be used to increase expression in mammalian host cells. SV40 
or EBV-based vectors may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of 
DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 
constructed and delivered via conventional delivery methods (liposomes, polycationic amino 
polymers, or vesicles) for therapeutic purposes (Harrington, J J. et al. (1997) Nat. Genet. 15:345-355). 

For long term production of recombinant proteins in mammalian systems, stable expression 
of PMMM in cell lines is preferred. For example, polynucleotides encoding PMMM can be 
transformed into cell lines using expression vectors which may contain viral origins of replication 
and/or endogenous expression elements and a selectable marker gene on the same or on a separate 
vector. Following the introduction of the vector, cells may be allowed to grow for about 1 to 2 days 
in enriched media before being switched to selective media. The purpose of the selectable marker is 
to confer resistance to a selective agent, and its presence allows growth and recovery of cells which 
successfully express the introduced sequences. Resistant clones of stably transformed cells may be 
propagated using tissue culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. These 
include, but are not limited to, the herpes simplex virus thymidine kinase and adenine 
phosphoribosyltransferase genes, for use in tk' and apr' cells, respectively (Wigler, M. et al. (1977) 
Cell 11:223-232; Lowy, 1. et al. (1980) CeU 22:817-823). Also, antumetabolite, antibiotic, or 
herbicide resistance can be used as the basis for selection. For example, dhfr confers resistance to 
methotrexate; neo confers resistance to the aminoglycosides neomycin and G-418; and als mdpat 
confer resistance to chlorsulfuron and phosphinotricm acetyltransferase, respectively (Wigler, M. et 
al. (1980) Proc. Natl. Acad. Sci. USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) J. Mol. Biol. 
150: 1-14). Additional selectable genes have been described, e.g., trpB and hisD, which alter cellular 
requirements for metabolites (Hartman, S.C. and R.C. Mulligan (1988) Proc. Natl. Acad. Sci. USA 
85:8047-8051). Visible markers, e.g., anthocyanins, green fluorescent proteins (GFP; Clontech), P- 
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glucuronidase and its substrate P-glucuronide, or luciferase and its substrate luciferin may be used. 
These markers can be used not only to identify transformants, but also to quantify the amount of 
transient or stable protein expression attributable to a specific vector system (Rhodes, C.A. (1995) 
Methods Mol. Biol. 55:121-131). 

Although the presence/absence of marker gene expression suggests that the gene of interest is 
also present, the presence and expression of the gene may need to be confirmed. For example, if the 
sequence encoding PMMM is inserted within a marker gene sequence, transformed cells containing 
polynucleotides encoding PMMM can be identified by the absence of marker gene function. 
Alternatively, a marker gene can be placed in tandem with a sequence encoding PMMM under the 
control of a single promoter. Expression of the nciarker gene in response to induction or selection 
usually indicates expression of the tandem gene as well. 

In general, host cells that contain the polynucleotide encoding PMMM and that express 
PMMM may be identified by a variety of procedures known to those of skill in the art These 
procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PGR 
amplification, and protein bioassay or immunoassay techniques which include membrane, solution, or 
chip based technologies for the detection and/or quantification of nucleic acid or protein sequences. 

Immunological methods for detecting and measuring the expression of PMMM using either 
specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques 
include enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and 
fluorescence activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing 
monoclonal antibodies reactive to two non-interfering epitopes on PMMM is preferred, but a 
competitive binding assay may be employed. These and other assays are well known in the art 
(Hampton, R. et al. (1990) Serological Methods, a Laboratory Manual , APS Press, St. Paul MN, Sect. 
IV; Coligan, J.E. et al. (1997) Current Protocols in Immunology , Greene Pub. Associates and Wiley- 
Interscience, New York NY; Pound, J.D. (1998) Immunochemical Protocols , Humana Press, Totowa 
NJ). 

A wide variety of labels and conjugation techniques are known by those skilled in the art and 
may be used in various nucleic acid and amino acid assays. Means for producing labeled 
hybridization or PGR probes for detecting sequences related to polynucleotides encoding PMMM 
include oligolabeling, nick translation, end-labeling, or PGR amplification using a labeled nucleotide. 
Alternatively, polynucleotides encoding PMMM, or any fi-agments thereof, may be cloned into a 
vector for the production of an mRNA probe. Such vectors are known in the art, are commercially 
available, and msiy be used to synthesize RNA probes in vitro by addition of an appropriate RNA 
polymerase such as T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted 
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using a variety of commercially available kits, such as those provided by Amersham Biosciences, 
Promega (Madison WI), and US Biochemical. Suitable reporter molecules or labels which may be 
used for ease of detection include radionuclides, enzymes, fluorescent, chemiluminescent, or 
chromogenic agents, as well as substrates, cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with polynucleotides encoding PMMM may be cultured under 
conditions suitable for the expression and recovery of the protein from cell culture. The protein 
produced by a transformed cell may be secreted or retained intracellularly depending on the sequence 
and/or the vector used. As will be understood by those of skill in the art, expression vectors 
containing polynucleotides which encode PMMM may be designed to contain signal sequences which 
direct secretion of PMMM through a prokaryotic or exikaryotic cell membrane. 

In addition, a host cell strain may be chosen for its ability to modulate expression of the 
inserted polynucleotides or to process the expressed protein in the desired fashion. Such 
modifications of the polypeptide include, but are not limited to, acetylation, carboxylation, 
glycosylation, phosphorylation, lipidation, and acylation. Post-translational processing which cleaves 
a "prepro" or "pro" form of the protein may also be used to specify protein targeting, folding, and/or 
activity. Different host cells which have specific cellular machinery and characteristic mechanisms 
for post-translational activities (e.g., CHO, HeLa, MDCK, HEK293, and WI38) are available from the 
American Type Culture Collection (ATCC, Manassas VA) and may be chosen to ensure the correct 
modification and processing of the foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant polynucleotides 
encoding PMMM may be ligated to a heterologous sequence resulting in translation of a fusion 
protein in any of the aforementioned host systems. For example, a chimeric PMMM protein 
containing a heterologous moiety that can be recognized by a commercially available antibody may 
facilitate the screening of peptide libraries for inhibitors of PMMM activity. Heterologous protein 
and peptide moieties may also facilitate purification of fusion proteins using commercially available 
affinity matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), 
maltose binding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, 
c-myc, and hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-HLs enable purification of their 
cognate fusion proteins on immobilized glutathione, maltose, phenylarsiae oxide, calmodulin, and 
metal-chelate resins, respectively. FLAG, c-inyc, and hemagglutinin (HA) enable immunoaffinity 
purification of fusion proteins using commercially available monoclonal and polyclonal antibodies 
that specifically recognize these epitope tags. A fusion protein naay also be engineered to contain a 
proteolytic cleavage site located between the PMMM encoding sequence and the heterologous protein 
sequence, so that PMMM may be cleaved away from the heterologous moiety following purification. 
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Methods for fusion protein expression and purification are discussed in Ausubel et al. {supra, ch. 10 
and 16). A variety of commercially available kits may also be used to facilitate expression and 
purification of fusion proteins. 

In another embodiment, synthesis of radiolabeled PMMM may be achieved in vitro using the 
TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These systems couple 
transcription and translation of protein-coding sequences operably associated with the T7, T3, or SP6 
promoters. Translation takes place in the presence of a radiolabeled amino acid precursor, for 
example, ^^S-methionine. 

PMMM, fragments of PMMM, or variants of PMMM may be used to screen for compounds 
that specifically bind to PMMM. One or more test compounds may be screened for specific binding 
to PMMM. In various embodiments, 1, 2, 3, 4, 5, 10, 20. 50, 100, or 200 test compounds can be 
screened for specific binding to PMMM. Examples of test compounds can include antibodies, 
anticalins, oligonucleotides, proteins (e.g., ligands or receptors), or small molecules. 

In related embodiments, variants of PMMM can be used to screen for binding of test 
compounds, such as antibodies, to PMMM, a variant of PMMM, or a combination of PMMM and/or 
one or more variants PMMM. In an embodiment, a variant of PMMM can be used to screen for 
compounds that bind to a variant of PMMM, but not to PMMM having the exact sequence of a 
sequence of SEQ ID NO: 1-58. PMMM variants used to perform such screening can have a range of 
about 50% to about 99% sequence identity to PMMM, with various embodiments having 60%, 70%, 
75%, 80%, 85%, 90%, and 95% sequence identity. 

In an embodhnent, a compound identified in a screen for specific binding to PMMM can be 
closely related to the natural ligand of PMMM, e.g., a ligand or fragment thereof, a natural substrate, 
a structural or functional mimetic, or a natural binding partner (Coligan, J.E. et al. (1991) Current 
Protocols in Immunology l(2):Chapter 5). In another embodiment, the compound thus identified can 
be a natural Ugand of a receptor PMMM (Howard, A.D. et al. (2001) Trends Pharmacol. Sci.22:132- 
140; Wise, A. et al. (2002) Drug Discovery Today 7:235-246). 

In other embodiments, a compound identified in a screen for specific binding to PMMM can 
be closely related to the natural receptor to which PMMM binds, at least a fragment of the receptor, 
or a fragment of the receptor including all or a portion of the ligand binding site or binding pocket. 
For example, the compound may be a receptor for PMMM which is capable of propagating a signal, 
or a decoy receptor for PMMM which is not capable of propagating a signal (Ashkenazi, A. and V.M. 
Divit (1999) Curr. Opin. Cell Biol. 11:255-260; Mantovani, A. et al. (2001) Trends Immunol. 22:328- 
336). The compound can be rationally designed using known techniques. Examples of such 
techniques include those used to construct the compound etanercept (ENBREL; Amgen Inc., 
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Thousand Oaks CA), which is efficacious for treatmg rheumatoid arthritis in humans. Etanercept is 
an engineered p75 tumor necrosis factor (TNF) receptor dimer linked to the Fc portion of human IgG j 
(Taylor, P.C. et al. (2001) Curr. Opin. Inmiunol. 13:611-616). 

In one embodiment, two or more antibodies having similar or, alternatively, different 
specificities can be screened for specific binding to PMMM, firagments of PMMM, or variants of 
PMMM. The binding specificity of the antibodies thus screened can thereby be selected to identify 
particular firagments or variants of PMMM. In one embodiment, an antibody can be selected such 
that its binding specificity allows for preferential identification of specific fragments or variants of 
PMMM. In another embodiment, an antibody can be selected such that its binding specificity allows 
for preferential diagnosis of a specific disease or condition having increased, decreased, or otherwise 
abnormal production of PMMM. 

In an embodiment, anticalins can be screened for specific binding to PMMM, firagments of 
PMMM, or variants of PMMM. Anticalins. are ligand-binding proteins that have been constructed 
based on a lipocalin scaffold (Weiss, G.A. and H.B. Lowman (2000) Chem. Biol. 7:R177-R184; 
Skerra, A. (2001) J. Biotechnol. 74:257-275). The protein architecture of lipocalins can include a 
beta-barrel having eight antiparallel beta-strands, which supports four loops at its open end. These 
loops form the natural ligand-binding site of the lipocalins, a site which can be re-engineered in vitro 
. by amino acid substitutions to impart novel binding specificities. The amino acid substitutions can be 
made usmg methods known in the art or described herein, and can include conservative substitutions 
(e.g., substitutions that do not alter binding specificity) or substitutions that modestly, moderately, or 
significantly alter binding specificity. 

In one embodiment, screening for compoimds which specifically bind to, stimulate, or inhibit 
PMMM involves producing appropriate cells which express PMMM, either as a secreted protein or 
on the cell membrane. Preferred cells can mclude cells from mammals, yeast, Drosophila, or E. coli. 
Cells expressing PMMM or cell membrane fractions which contain PMMM are then contacted with a 
test compound and binding, stimulation, or inhibition of activity of either PMMM or the compound is 
analyzed. 

An assay may simply test binding of a test compound to the polypeptide, wherein binding is 
detected by a fluorophore, radioisotope, enzyme conjugate, or other detectable label. For example, 
the assay may comprise the steps of combining at least one test compound with PMMM, either in 
solution or affixed to a solid support, and detecting the binding of PMMM to the compound. 
Alternatively, the assay may detect or measure binding of a test compound in the presence of a 
labeled competitor. Additionally, the assay may be carried out using cell-free preparations, chemical 
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libraries, or natural product mixtures, and the test compound(s) may be free in solution or affixed to a 
solid support. 

An assay can be used to assess the ability of a compound to bind to its natural hgand and/or 
to inhibit the binding of its natural ligand to its natural receptors. Examples of such assays include 
radio-labeling assays such as those described in U.S. Patent No. 5,914,236 and U.S. Patent No. 
6,372,724. In a related embodiment, one or more amino acid substitutions can be introduced into a 
polypeptide compound (such as a receptor) to improve or alter its abiUty to bmd to its natural ligands 
(Matthews, D.J. and J. A. Wells. (1994) Chem. Biol. 1:25-30). In another related embodiment, one or 
more amino acid substitutions can be introduced into a polypeptide compound (such as a ligand) to 
improve or alter its ability to bind to its natural receptors (Cmmingham, B.C. and J.A. Wells (1991) 
Proc. Natl. Acad. Sci. USA 88:3407-3411; Lowman, H.B. et al. (1991) J. Biol. Chem. 266:10982- 
10988). 

PMMM, fragments of PMMM, or variants of PMMM may be used to screen for compounds 
that modulate the activity of PMMM. Such compounds may include agonists, antagonists, or partial 
or inverse agonists. In one embodiment, an assay is performed under conditions permissive for 
PMMM activity, wherein PMMM is combined with at least one test compound, and the activity of 
PMMM in the presence of a test compound is compared with the activity of PMMM in the absence of 
the test compound. A change in the activity of PMMM in the presence of the test compound is 
indicative of a compound that modulates the activity of PMMM. Alternatively, a test compound is 
combined with an in vitro or ceU-free system comprising PMMM under conditions suitable for 
PMMM activity, and the assay is performed. In either of these assays, a test compound which 
modulates the activity of PMMM may do so indirectly and need not come in direct contact with the 
test compound. At least one and up to a plurality of test compounds may be screened. 

In another embodiment, polynucleotides encoding PMMM or their mammalian homologs 
may be "knocked out" in an animal model system using homologous recombination in embryonic 
stem (ES) cells. Such techniques are well knoMOi in the art and are useftil for the generation of animal 
models of human disease (see, e.g., U.S. Patent No. 5,175,383 and U.S. Patent No. 5,767.337). For 
example, mouse ES cells, such as the mouse 129/SvJ cell line, are derived from the early mouse 
embryo and grown in culture. The ES cells are transformed with a vector containing the gene of 
interest disrupted by a marker gene, e.g., the neomycin phosphotransferase gene (neo; Capecchi, M.R. 
(1989) Science 244:1288-1292). The vector mtegrates into the corresponding region of the host 
genome by homologous recombination. Alternatively, homologous recombination takes place using 
the Cre-loxP system to knockout a gene of interest in a tissue- or developmental stage-specific 
manner (Marth, J.D. (1996) Clin. Invest. 97:1999-2002; Wagner, K.U. et al. (1997) Nucleic Acids 
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Res. 25:4323-4330). Transformed ES cells are identified and microinjected into mouse cell 
blastocysts such as those from the C57BL/6 mouse strain. The blastocysts are surgically transferred 
to pseudopregnant dams, and the resulting chimeric progeny are genotyped and bred to produce 
heterozygous or homozygous strains. Transgenic animals thus generated may be tested with potential 
therapeutic or toxic agents. 

Polynucleotides encoding PMMM may also be manipulated in vitro in ES cells derived from 
human blastocysts. Human ES cells have the potential to differentiate mto at least eight separate cell 
lineages including endoderm, mesoderm, and ectodermal cell types. These cell lineages differentiate 
into, for example, neural cells, hematopoietic lineages, and cardiomyocytes (Thonason, J. A. et al. 
(1998) Science 282: 1 145-1 147). 

Polynucleotides encoding PMMM can also be used to create "knockin" humanized animals 
(pigs) or transgenic animals (mice or rats) to model human disease. With knockin technology, a 
region of a polynucleotide encoding PMMM is injected into animal ES cells, and the injected 
sequence integrates into the animal cell genome. Transformed cells are injected into blastulae, and 
the blastulae are implanted as described above. Transgenic progeny or inbred lines are studied and 
treated with potential pharmaceutical agents to obtain information on treatment of a human disease. 
Alternatively, a mammal inbred to overexpress PMMM, e.g., by secreting PMMM in its milk, may 
also serve as a.convenient source of that protein (Janne, J. et al. (1998) Biotechnol. Annu. Rev. 4:55- 
74). 

THERAPEUTICS 

Chemical and structural similarity, e.g., in the context of sequences and motifs, exists 
between regions of PMMM and protein modification and maintenance molecules. In addition, 
examples of tissues expressing PMMM can be foimd in Table 6 and can also be found in Example XI. 
Therefore, PMMM appears to play a role in gastrointestinal, cardiovascular, 
autoimmune/inflammatory, cell proliferative, developmental, epithelial, neurological, and 
reproductive disorders. In the treatment of disorders associated with increased PMMM expression or 
activity, it is desirable to decrease the expression or activity of PMMM. In the treatment of disorders 
associated with decreased PMMM expression or activity, it is desirable to increase the expression or 
activity of PMMM. 

Therefore, in one embodiment, PMMM or a fragment or derivative thereof may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of PMMM. Examples of such disorders include, but are not limited to, a gastrointestinal 
disorder, such as dysphagia, peptic esophagitis, esophageal spasm, esophageal stricture, esophageal 
carcinoma, dyspepsia, indigestion, gastritis, gastric carcinoma, anorexia, nausea, emesis. 
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gastroparesis, antral or pyloric edema, abdominal angina, pyrosis, gastroenteritis, intestinal 
obstruction, infections of the intestinal tract, peptic ulcer, cholelithiasis, cholecystitis, cholestasis, 
pancreatitis, pancreatic carcinoma, biliary tract disease, hepatitis, hyperbilirubinemia, cirrhosis, 
passive congestion of the liver, hepatoma, infectious colitis, ulcerative colitis, ulcerative proctitis, 
Crohn's disease, Whipple's disease, Mallory-Weiss syndrome, colonic carcinoma, colonic 
obstruction, irritable bowel syndrome, short bowel syndrome, diarrhea, constipation, gastrointestinal 
hemorrhage, acquired immunodeficiency syndrome (AIDS) enteropathy, jaundice, hepatic 
encephalopathy, hepatorenal syndrome, hepatic steatosis, hemochromatosis, Wilson's disease, alpha]- 
antitrypsin deficiency, Reye's syndrome, primary sclerosing cholangitis, liver infarction, portal vein 
obstruction and thrombosis, centrilobular necrosis, peliosis hepatis, hepatic vein thrombosis, veno- 
occlusive disease, preeclampsia, eclampsia, acute fatty liver of pregnancy, intrahepatic cholestasis of 
pregnancy, and hepatic tumors including nodular hyperplasias, adenomas, and carcinomas; a 
cardiovascular disorder, such as arteriovenous fistula, atherosclerosis, hypertension, vasculitis, 
Raynaud's disease, aneurysms, arterial dissections, varicose veins, thrombophlebitis and 
phlebothrombosis, vascular tumors, and complications of thrombolysis, balloon angioplasty, vascular 
replacement, and coronary artery bypass graft surgery, congestive heart failure, ischemic heart 
disease, angina pectoris, myocardial infarction, hypertensive heart disease, degenerative valvular 
heart disease, calcific aortic valve stenosis, congenitally bicuspid aortic valve, mitral annular 
calcification, mitral valve prolapse, rheumatic fever and rheumatic heart disease, infective 
endocarditis, nonbacterial thrombotic endocarditis, endocarditis of systemic lupus erythematosus, 
carcinoid heart disease, cardiomyopathy, myocarditis, pericarditis, neoplastic heart disease, 
congenital heart disease, and complications of cardiac transplantation; an autoimmune/inflanmiatory 
disease, such as acquired immunodeficiency syndrome (AIDS), Addison's disease, adult respiratory 
distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, atherosclerosis, 
atherosclerotic plaque rupture, autoimmune hemolytic anenrua, autoimmune thyroiditis, autoimmune 
polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, cholecystitis, contact 
dermatitis, Crohn"s disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, 
episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic 
gastritis, glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, Hashimoto's 
thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, 
myocardial or pericardial inflanmiation, osteoarthritis, degradation of articular cartilage, osteoporosis, 
pancreatitis, polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, Sjogren's 
syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, thrombocytopenic 
purpura, ulcerative colitis, uveitis, Werner syndrome, complications of cancer, hemodialysis, and 
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extracorporeal circulation, viral, bacterial, fungal, parasitic, protozoal, and helminthic infections, and 
trauma; a cell proliferative disorder such as acdnic keratosis, arteriosclerosis, atherosclerosis, bursitis, 
cirrhosis, hepatitis, mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal 
hemoglobinuria, polycythemia vera, psoriasis, primary thrombocythemia, and cancers including 
adenocarcinoma, leukemia, lyn]{>homa, melanoma, myeloma, sarcoma, teratocarcinoma, and, in 
particular, cancers of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, colon, gall 
bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, 
penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; a developmental 
disorder, such as renal tubular acidosis, anemia, Cushing's syndrome, achondroplastic dwarfism, 
Duchenne and Becker muscular dystrophy, bone resorption, epilepsy, gonadal dysgenesis, WAGR 
syndrome (Wilms' tumor, aniridia, genitourinary abnormalities, and mental retardation), Smith- 
Magenis syndrome, myelodysplastic syndrome, hereditary mucoepithelial dysplasia, hereditary 
keratodermas, hereditary neuropathies such as Chare ot-Marie-Tooth disease and neurofibromatosis, 
hypothyroidism, hydrocephalus, seizure disorders such as Syndenham's chorea and cerebral palsy, 
spina bifida, anencephaly, craniorachischisis, congenital glaucoma, cataract, age-related macular 
degeneration, and sensorineural hearing loss; an epithelial disorder, such as dyshidrotic eczema, 
allergic contact dermatitis, keratosis pilaris, melasma, vitiligo, actinic keratosis, basal cell icarcinoma, 
squamous cell carcinoma, seborrheic keratosis, folliculitis, herpes simplex, herpes zoster, varicella, 
candidiasis, dermatophytosis, scabies, insect bites, cherry angioma, keloid, dermatofibroma, 
acrochordons, urticaria, transient acantholytic dermatosis, xerosis, eczema, atopic dermatitis^ contact 
dermatitis, hand eczema, nummular eczema, lichen simplex chronicus, asteatotic eczema, stasis 
dermatitis and stasis ulceration, seborrheic dermatitis, psoriasis, lichen planus, pityriasis rosea, 
impetigo, ecthyma, dermatophytosis, tinea versicolor, warts, acne vulgaris, acne rosacea, pemphigus 
vulgaris, pemphigus foliaceus, paraneoplastic pemphigus, bullous pemphigoid, herpes gestationis, 
dermatitis herpetiformis, linear IgA disease, epidermolysis bullosa acquisita, dermatomyositis, lupus 
erythematosus, scleroderma and morphea, erythroderma, alopecia, figurate skin lesions, 
telangiectasias, hypopigmentation, hyperpigmentation, vesicles/buUae, exanthems, cutaneous drug 
reactions, papulonodular skin lesions, chronic non-healing wounds, photosensitivity diseases, 
epidermolysis bullosa simplex, epidermolytic hyperkeratosis, epidermolytic and nonepidermolytic 
palmoplantar keratoderma, ichthyosis bullosa of Siemens, ichthyosis exfoliativa, keratosis palmaris et 
plantaris, keratosis palmoplantaris, palmoplantar keratoderma, keratosis punctata, Meesmann's 
corneal dystrophy, pachyonychia congenita, white sponge nevus, steatocystoma multiplex, epidermal 
nevi/epidermolytic hyperkeratosis type, monilethrix, trichothiodystrophy, chronic 
hepatitis/cryptogenic cirrhosis, and colorectal hyperplasia; a neiurological disorder, such as epilepsy. 
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ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease, Pick's disease, 
Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, amyotrophic 
lateral sclerosis and other motor neuron disorders, progressive neural muscular atrophy, retinitis 
pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating diseases, bacterial and 
viral meningitis, brain abscess, subdural empyema, epidural abscess, suppurative intracranial 
thrombophlebitis, myelitis and radiculitis, viral central nervous system disease, prion diseases 
includmg kuru, Creutzfeldt-Jakob disease, and Gerstmann-Straussler-Scheinker syndrome, fatal 
familial insomnia, nutritional and metabolic diseases of the nervous system, neurofibromatosis, 
tuberous sclerosis, cerebelloretinal hemangioblastomatosis, encephalotrigeminal syndrome, mental 
retardation and other developmental disorders of the central nervous system including Down 
syndrome, cerebral palsy, neuroskeletal disorders, autonomic nervous system disorders, cranial nerve 
disorders, spinal cord diseases, muscular dystrophy and other neuromuscular disorders, peripheral 
nervous system disorders, dermatomyositis and polymyositis, inherited, metabolic, endocrine, and 
toxic myopathies, myasthenia gravis, periodic paralysis, mental disorders including mood, anxiety, 
and schizophrenic disorders, seasonal affective disorder (SAD), akathesia, amnesia, catatonia, 
diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, 
Tourette's disorder, progressive supranuclear palsy, corticobasal degeneration, and familial 
frontotemporal dementia; and a reproductive disorder, such as infertility, including tubal disease, 
ovulatory defects, and endometriosis, a disorder of prolactin production, a disruption of the estrous 
cycle, a disruption of the menstrual cycle, polycystic ovary syndrome, ovarian hyperstimulation 
syndrome, an endometrial or ovarian tumor, a uterine fibroid, autoimmune disorders, an ectopic 
pregnancy, and teratogenesis; cancer of the breast, fibrocystic breast disease, and galactorrhea; a 
disruption of spermatogenesis, abnormal sperm physiology, cancer of the testis, cancer of the 
prostate, benign prostatic hyperplasia, prostatitis, Peyronie's disease, impotence, carcinoma of the 
male breast, and gynecomastia. 

In another embodiment, a vector capable of expressing PMMM or a fragment or derivative 
thereof may be administered to a subject to treat or prevent a disorder associated with decreased 
expression or activity of PMMM including, but not limited to, those described above. 

In a further embodiment, a composition comprising a substantially purified PMMM in 
conjunction with a suitable pharmaceutical carrier may be administered to a subject to treat or prevent 
a disorder associated with decreased expression or activity of PMMM including, but not limited to, 
those provided above. 
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In still another embodiment, an agonist which modulates the activity of PMMM may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of PMMM including, but not limited to, those listed above. 

In a further embodiment, an antagonist of PMMM may be administered to a subject to treat or 
prevent a disorder associated with increased expression or activity of PMMM. Examples of such 
disorders include, but are not limited to, those gastrointestinal, cardiovascular, 
autoimmune/inflammatory, cell proliferative, developmental, epithelial, neurological, and 
reproductive disorders described above. In one aspect, an antibody which specifically binds PMMM 
may be used directly as an antagonist or indirectly as a targeting or delivery mechanism for bringing a 
pharmaceutical agent to cells or tissues which express PMMM. 

In an additional embodiment, a vector expressing the complement of the polynucleotide 
encoding PMMM may be administered to a subject to treat or prevent a disorder associated with 
increased expression or activity of PMMM including, but not limited to, those described above. 

In other embodiments, any protein, agonist, antagonist, antibody, complementary sequence, 
or vector embodiments may be administered in combination with other appropriate therapeutic 
agents. Selection of the appropriate agents for use in combination therapy may be made by one of 
ordinary skill in the art, according to conventional pharmaceutical principles. The combination of 
therapeutic agents may act synergistically to effect the treatment or prevention of the various 
disorders described above. Using this approach, one may be able to achieve therapeutic efficacy with 
lower dosages of each agent, thus reducing the potential for adverse side effects. 

An antagonist of PMMM may be produced using methods which are generally known in the 
art. In particular, purified PMMM may be used to produce antibodies or to screen libraries of 
pharmaceutical agents to identify those which specifically bind PMMM. Antibodies to PMMM may 
also be generated using methods that are well known in the art. Such antibodies may include, but are 
not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies. Fab fragments, and 
fragments produced by a Fab expression library. In an embodunent, neutralizing antibodies (i.e., 
those which inhibit duner formation) can be used therapeutically. Single chain antibodies (e.g., from 
camels or llamas) may be potent enzyme inhibitors and may have application in the design of peptide 
nrimetics, and in the development of immuno-adsorbents and biosensors (Muyldermans, S. (2001) J. 
Biotechnol. 74:277-302). 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, camels, 
dromedaries, llamas, humans, and others may be immunized by injection witii PMMM or with any 
fragment or oligopeptide thereof which has immunogenic properties. Depending on the host species, 
various adjuvants may be used to increase inununological response. Such adjuvants include, but are 
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not limited to, Freund's, mineral gels such as almninum hydroxide, and surface active substances such 
as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, KLH, and dinitrophenoL 
Among adjuvants used in humans, BCG (bacilli Calmette-Guerin) and Corynehacterium parvum are 
especially preferable. 

It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to 
PMMM have an amino acid sequence consisting of at least about 5 amino acids, and generally will 
consist of at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or 
fragments are substantially identical to a portion of the amino acid sequence of the natural protein. 
Short stretches of PMMM amino acids may be fused with those of another protein, such as KLH, and 
antibodies to the chimeric molecule may be produced. 

Monoclonal antibodies to PMMM may be prepared using any technique which provides for 
the production of antibody molecules by continuous cell lines in culture. These include, but are not 
limited to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV-hybridoma 
technique (Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. Immunol. 
Methods 81:31-42; Cote, R.J. et al. (1983) Proc. Natl. Acad. Sci. USA 80:2026-2030; Cole, S.P. et al. 
(1984) MoL Ceil Biol 62:109-120). 

In addition, techniques developed for the production of "chimeric antibodies," such as the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
antigen specificity and biological activity, can be used (Morrison, S.L. et al. (1984) Proc. Natl. Acad. 
Sci. USA 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 312:604-608; Takeda, S. et al. (1985) 
Nature 314:452-454). Alternatively, techniques described for the production of single chain 
antibodies may be adapted, using methods known in the art, to produce PMMM-specific single chain 
antibodies. Antibodies with related specificity, but of distinct idiotypic composition, may be 
generated by chain shuffling from random combinatorial immimoglobulin libraries (Burton, D.R. 
(1991) Proc. Natl. Acad. Sci. USA 88:10134-10137). 

Antibodies may also be produced by inducing in vivo production in the lymphocyte 
population or by screening immunoglobulin libraries or panels of highly specific binding reagents as 
disclosed in the literature (Orlandi, R. et al. (1989) Proc. Natl. Acad. Sci. USA 86:3833-3837; Winter, 
G. et al. (1991) Nature 349:293-299). 

Antibody fragments which contain specific binding sites for PMMM may also be generated. 
For example, such fragments include, but are not limited to, F(ab% fragments produced by pepsin 
digestion of the antibody molecule and Fab fragments generated by reducing the disulBde bridges of 
the F(ab')2 fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and 
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easy identification of monoclonal Fab fragments with the desired specificity (Huse, W.D. at al. (1989) 
Science 246:1275-1281). 

Various immunoassays may be used for screening to identify antibodies having the desired 
specificity. Numerous protocols for competitive binding or immunoradiometric assays using either 
polyclonal or monoclonal antibodies with established specificities are well known in the art. Such 
unmunoassays typically involve the measurement of complex formation between PMMM and its 
specific antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies 
reactive to two non-interfering PMMM epitopes is generally used, but a competitive binding assay 
may also be employed (Pound, supra). 

Various methods such as Scatchard analysis in conjunction with radioinununoassay 
techniques may be used to assess the affinity of antibodies for PMMM. Affinity is expressed as an 
association constant, K^, which is defined as the molar concentration of PMMM-antibody complex 
divided by the molar concentrations of fi-ee antigen and free antibody under equilibrium conditions. 
The Ka determined for a preparation of polyclonal antibodies, which are heterogeneous in their 
affinities for multiple PMMM epitopes, represents the average affinity, or avidity, of the antibodies 
for PMMM. The K^^ determined for a preparation of monoclonal antibodies, which are monospecific 
for a particular PMMM epitope, represents a true measure of affinity. High-affinity antibody 
preparations with ranging from about 10^ to 10^^ L/mole are preferred for use in immunoassays in 
which the PMMM-antibody complex must withstand rigorous manipulations. Low-affinity antibody 
preparations with ranging from about 10^ to 10^ L/mole are preferred for use in 
immunopurification and similar procedures which ultimately require dissociation of PMMM, 
preferably in active form, from the antibody (Catty, D. (1988) Antibodies. Volume I: A Practical 
Approach . IRL Press, Washington DC; Liddell, J.E. and A. Cryer (1991) A Practical Guide to 
Monoclonal Antibodies . John Wiley & Sons, New York NY). 

The titer and avidity of polyclonal antibody preparations may be fiirther evaluated to 
determine the quality and suitability of such preparations for certain downstream applications. For 
example, a polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, 
preferably 5-10 mg specific antibody/nnJ, is generally employed in procedures requiring precipitation 
of PMMM-antibody complexes. Procedures for evaluating antibody specificity, titer, and avidity, and 
guidelines for antibody quality and usage in various applications, are generally available (Catty, 
supra; Coligan et al., supra). 

Jn another embodiment of the invention, polynucleotides encoding PMMM, or any fragment 
or complement thereof, may be used for therapeutic purposes. In one aspect, modifications of gene 
expression can be achieved by designing complementary sequences or antisense molecules (DNA, 
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RNA, PNA, or modified oligonucleotides) to the coding or regulatory regions of the gene encodmg 
PMMM- Such technology is well known in the art, and antisense oligonucleotides or larger 
fragments can be designed from various locations along the coding or control regions of sequences 
encoding PMMM (Agrawal. S., ed. (1996) Antisense Therapeutics , Humana Press, Totawa NJ). 

In therapeutic use, any gene delivery system suitable for introduction of the antisense 
sequences into appropriate target cells can be used. Antisense sequences can be delivered 
intracellularly in the form of an expression plasmid which, upon transcription, produces a sequence 
complementary to at least a portion of the cellular sequence encodmg the target protein (Slater, J.E. et 
al. (1998) J. AUergy Clin. Immunol. 102:469-475; Scanlon, K.J. et al. (1995) 9:1288-1296). 
• Antisense sequences can also be introduced intracellularly through the use of viral vectors, such as 
retrovirus and adeno-associated virus vectors (Miller, A.D. (1990) Blood 76:271; Ausubel et al., 
supra\ Uckert, W. and W. Walther (1994) Pharmacol. Ther. 63:323-347). Other gene dehvery 
mechanisms include liposome-derived systems, artificial viral envelopes, and other systems known in 
the art (Rossi, J.J. (1995) Br. Med. Bull. 51:217-225; Boado, RJ. et al. (1998) J. Pharm. Sci. 
87:1308-1315; Morris, M,C. et al. (1997) Nucleic Acids Res. 25:2730-2736). 

In another embodiment of the invention, polynucleotides encoding PMMM may be used for 
somatic or germline gene therapy. Gene therapy may be performed to (i) correct a genetic deficiency 
(e.g., in the cases of severe combined immunodeficiency (SCID)-Xl disease characterized by X- 
linked inheritance (Cavazzana-Calvo, M. et al. (2000) Science 288:669-672), severe combined 
immunodeficiency syndrome associated with an inherited adenosine deaminase (ADA) deficiency 
(Blaese, R.M. et al. (1995) Science 270:475-480; Bordignon, C. et al. (1995) Science 270:470-475), 
cystic fibrosis (Zabner, J. et al. (1993) Cell 75:207-216; Crystal, R.G. et al. (1995) Hum. Gene 
Therapy 6:643-666; Crystal, R.G. et al. (1995) Hum. Gene Therapy 6:667-703), thalassamias, familial 
hypercholesterolemia, and hemophilia resulting fi-om Factor Vin or Factor IX deficiencies (Crystal, 
R.G. (1995) Science 270:404-410; Verma, I.M. and N. Somia (1997) Nature 389:239-242)), (ii) 
express a conditionally lethal gene product (e.g., in the case of cancers which result from unregulated 
cell proliferation), or (iii) express a protein which affords protection against intracellular parasites 
(e.g., against human retroviruses, such as human immunodeficiency virus (HIV) (Baltimore, D. 
(1988) Nature 335:395-396; Poeschla, E. et al. (1996) Proc. Natl. Acad. Sci. USA 93:11395-11399), 
hepatitis B or C virus (HBV, HCV); fiingal parasites, such as Candida albicans and Paracoccidioides 
brasiliensis; and protozoan parasites such as Plasmodium falciparum and Trypanosoma cruzi)- In the 
case where a genetic deficiency in PMMM expression or regulation causes disease, the expression of 
PMMM from an appropriate population of transduced cells may alleviate the clinical manifestations 
caused by the genetic deficiency. 
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In a fujrther embodiment of the invention, diseases or disorders caused by deficiencies in 
PMMM are treated by constructing mammalian expression vectors encoding PMMM and introducing 
these vectors by mechanical means into PMMM-deficient cells. Mechanical transfer technologies for 
use with cells in vivo or ex vitro include (i) direct DNA DMcroinjection into individual cells, (ii) 
ballistic gold particle delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated gene 
transfer, and (v) die use of DNA transposons (Morgan, R.A. and W.F. Anderson (1993) Annu. Rev. 
Biochem, 62:191-217; Ivies, Z. (1997) Cell 91:501-510; Boulay, J.-L. and H. Recipon (1998) Curr. 
Opin. Biotechnol. 9:445-450). 

Expression vectors that may be effective for the expression of PMMM include, but are not 
limited to, the PCDNA 3.1, EPITAG, PRCCMV2, PREP, PVAX, PCR2-TOPOTA vectors 
(Invitrogen, Carlsbad CA), PCMV-SCRIPT, PCMV-TAG, PEGSH/PERV (Stratagene, La JoUa CA), 
and PTET-OFF, PTET-ON, PTRE2, PTRE2-LUC, PTK-HYG (Clontech, Palo Alto CA). PMMM 
may be expressed usmg (i) a constitutively active promoter, (e.g., from cytomegalovirus (CMV), 
Rous sarcoma virus (RSV), SV40 virus, thymidine kinase (TK), or p-actin genes), (ii) an inducible 
promoter (e.g., the tetracycline-regulated promoter (Gossen, M. and H. Bujard (1992) Proc. Natl. 
Acad. Sci. USA 89:5547-5551; Gossen, M. et al. (1995) Science 268:1766-1769; Rossi, F.M.V. and 
H.M. Blau (1998) Curr. Opin. Biotechnol. 9:451-456), commercially available in the T-REX plasmid 
(Invitrogen)); the ecdysone-inducible promoter (available in the plasmids PVGRXR and PIND; 
Invitrogen); the FK506/rapamycin inducible promoter; or the RU486/mifepristone inducible promoter 
(Rossi, F.M.V. and H.M. Blau, supra)), or (iii) a tissue-specific promoter or the native promoter of 
the endogenous gene encoding PMMM from a normal individual. 

Commercially available liposome transformation kits (e.g., the PERFECT LIPID 
TRANSFECTION KIT, available from Invitrogen) allow one with ordinary skill m the art to deliver 
polynucleotides to target cells in culture and require minimal effort to optimize experimental 
parameters. In the alternative, transformation is performed using the calcium phosphate method 
(Graham, F.L. and A.J. Eb (1973) Virology 52:456-467), or by electroporation (Neumann, E. et al. 
(1982) EMBO J. 1:841-845). The introduction of DNA to primary cells requires modification of 
these standardized mammalian transfection protocols. 

In another embodiment of the invention, diseases or disorders caused by genetic defects with 
respect to PMMM expression are treated by constructing a retrovirus vector consisting of (i) the 
polynucleotide encoding PMMM under the control of an independent promoter or the retrovirus long 
terminal repeat (LTR) promoter, (ii) appropriate RNA packaging signals, and (iii) a Rev-responsive 
element (RRE) along with additional retrovirus cw-acting RNA sequences and coding sequences 
required for efficient vector propagation. Retrovirus vectors (e.g., PFB and PFBNEO) are 
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commercially available (Stratagene) and are based on published data (Riviere, L et al. (1995) Proc. 
Natl, Acad. Sci. USA 92:6733-6737), incorporated by reference herein. The vector is propagated in 
an appropriate vector producing cell line (VPCL) that expresses an envelope gene with a tropism for 
receptors on the target cells or a pronfuscuous envelope protein such as VSVg (Armentano, D. et al. 
(1987) J. Virol. 61:1647-1650; Bender, M.A. et al. (1987) J. Vkol. 61:1639-1646; Adam, M.A. and 
A.D. MiUer (1988) J. Virol. 62:3802-3806; Dull, T. et al. (1998) J. Virol. 72:8463-8471; Zufferey, R. 
et al. (1998) J. Virol. 72:9873-9880). U.S. Patent No. 5,910,434 to Rigg ("Method for obtaining 
retrovirus packaging cell lines producing high transducing efficiency retroviral supernatant") 
discloses a method for obtaining retrovirus packaging cell lines and is hereby incorporated by 

• reference. Propagation of retrovirus vectors, transduction of a population of cells (e.g., CD4^ T- 
cells), and the return of transduced cells to a patient are procedures well known to persons skilled in 
the art of gene therapy and have been well documented (Ranga, U. et al. (1997) J. Virol. 71:7020- 
7029; Bauer, G. et al. (1997) Blood 89:2259-2267; Bonyhadi, M.L. (1997) J. Virol. 71:4707-4716; 
Ranga, U. et al. (1998) Proc. Natl. Acad. Sci. USA 95:1201-1206; Su, L. (1997) Blood 89:2283- 

: 2290). 

In an embodiment, an adenovirus-based gene therapy delivery system is used to deliver 
polynucleotides encoding PMMM to cells which have one or more genetic abnormalities with respect 
to the expression of PMMM. The construction and packaging of adenovirus-based vectors are well 
known to those with ordinary skill in the art. Replication defective adenovirus vectors have proven to 
be versatile for importing genes encoding immunoregulatory proteins into intact islets in the pancreas 
(Csete, M.E. et al. (1995) Transplantation 27:263-268). Potentially useful adenoviral vectors are 
described in U.S. Patent No. 5,707,618 to Armentano ("Adenovirus vectors for gene therapy"), 
hereby incorporated by reference. For adenoviral vectors, see also Antinozzi, P. A. et al. (1999; Annu. 
Rev. Nutr. 19:511-544) and Verma, I.M. and N. Somia (1997; Nature 18:389:239-242). 

In another embodiment, a herpes-based, gene therapy delivery system is used to deliver 
polynucleotides encoding PMMM to target cells which have one or more genetic abnormalities with 
respect to the expression of PMMM. The use of herpes simplex virus (HSV)-based vectors may be 
especially valuable for mtroducing PMMM to cells of the central nervous system, for which HSV has 
a tropism. The construction and packaging of herpes-based vectors are well known to those with 
ordinary skill in the art. A replication-competent herpes simplex virus (HSV) type 1-based vector has 
been used to deliver a reporter gene to the eyes of primates (Liu, X. et al. (1999) Exp. Eye Res. 
169:385-395). The construction of a HSV-1 virus vector has also been disclosed in detail in U.S. 
Patent No. 5,804,413 to DeLuca ("Herpes shnplex virus strains for gene transfer"), which is hereby 
incorporated by reference. U.S. Patent No. 5,804,413 teaches the use of recombinant HSV d92 which 
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consists of a genome containing at least one exogenous gene to be transferred to a cell imder the 
control of the appropriate promoter for purposes including human gene therapy. Also taught by this 
patent are the construction and use of recombinant HSV strains deleted for ICP4, ICP27 and ICP22. 
For HSV vectors, see also Goins, W.F. et al. (1999; J. Virol. 73:519-532) and Xu, H. et al, (1994; 
Dev. Biol. 163:152-161). The manipulation of cloned herpesvirus sequences, the generation of 
recombinant virus following the transfection of multiple plasmids containing different segments of 
the large herpesvirus genomes, the growth and propagation of herpesvirus, and the infection of cells 
with herpesvirus are techniques well known to those of ordinary skill In the art. 

In another embodiment, an alphavirus (positive, single-stranded RNA virus) vector is used to 
deliver polynucleotides encoding PMMM to target cells. The biology of the prototypic alphavirus, 
Semliki Forest Virus (SFV), has been studied extensively and gene transfer vectors have been based 
on the SFV genome (Garoff, H. and K.-J. Li (1998) Curr. Opin. BiotechnoL 9:464-469). During 
alphavirus RNA replication, a subgenomic RNA is generated that nonnally encodes the viral capsid 
proteins. This subgenomic RNA replicates to higher levels than the full length genomic RNA, 
resulting in the overproduction of capsid proteins relative to the viral proteins with enzymatic activity 
(e.g., protease and polymerase). Similarly, inserting the coding sequence for PMMM into the 
alphavirus genome in place of the capsid-coding region results in the production of a large number of 
PMMM-coding RNAs and the synthesis of high levels of PMMM in vector transduced cells. While 
alphavirus infection is typicially associated with cell lysis within a few days, the ability to estabHsh a 
persistent infection in hamster normal kidney cells (BHK-21) with a variant of Sindbis virus (SIN) 
indicates that the lytic replication of alphaviruses can be altered to suit the needs of the gene therapy 
application (Dryga, S.A. et al. (1997) Virology 228:74-83). The wide host range of alphaviruses will 
allow the introduction of PMMM into a variety of cell types. The specific transduction of a subset of 
cells in a population may require the sorting of cells prior to transduction. The methods of 
manipulating infectious cDNA clones of alphaviruses, performing alphavkus cDNA and RNA 
transfections, and performing alphavirus infections, are well known to those with ordinary skill in the 
art. 

Oligonucleotides derived from the transcription initiation site, e.g., between about positions 
-10 and +10 from the start site, may also be employed to inhibit gene expression. Similarly, 
inhibition can be achieved using triple helix base-pairing methodology. Triple helix pairing is useful 
because it causes inhibition of the ability of the double helix to open sufficiently for the binding of 
polymerases, transcription factors, or regulatory molecules. Recent therapeutic advances usmg 
triplex DNA have been described in the literature (Gee, J.E. et al. (1994) in Huber, B.E. and B.I. Carr, 
Molecular and Immunologic Approaches , Futura Publishing, Mt. Kisco NY, pp. 163-177). A 



70 



wo 03/063688 



PCT/US03/02500 



complementary sequence or antisense molecule may also be designed to block translation of mRNA 
by preventing the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 
RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 
molecule to complementary target RNA, followed by endonucleolytic cleavage. For example, 
engineered hammerhead motif ribozyme molecules may specifically and efBciently catalyze 
endonucleolytic cleavage of RNA molecules encoding PMMM. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, 
GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, 
corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
secondary structural features which may render the oligonucleotide inoperable. The suitability of 
candidate targets may also be evaluated by testing accessibility to hybridization with complementary 
oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes may be prepared by any method 
known in the art for the synthesis of nucleic acid molecules. These include techniques for chemically 
synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis. Alternatively, 
RNA molecules may be generated by in vitro and in vivo transcription of DNA molecules encoding 
PMMM. Such DNA sequences may be incorporated into a wide variety of vectors with suitable RNA 
polymerase promoters such as T7 or SP6. Alternatively, these cDNA constructs that synthesize 
complementary RNA, constitutively or inducibly, can be introduced into cell lines, cells, or tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. Possible 
modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3' 
ends of the molecule, or the use of phosphorothioate or 2' O-methyl rather than phosphodiesterase 
linkages within the backbone of the molecule. This concept is mherent in the production of PNAs 
and can be extended in all of these molecules by the inclusion of nontraditional bases such as inosine, 
queosine, and wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, 
cytidine, guanine, thymine, and uridine which are not as easily recognized by endogenous 
endonucleases. 

In other embodiments of the invention, the expression of one or more selected 
polynucleotides of the present invention can be altered, inhibited, decreased, or silenced using RNA 
interference (RNAi) or post-transcriptional gene silencing (PTGS) methods known in the art. RNAi 
is a post-transcriptional mode of gene silencing in which double-stranded RNA (dsRNA) introduced 
into a targeted cell specifically suppresses the expression of the homologous gene (i.e., the gene 
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bearing the sequence complementary to the dsRNA). This effectively knocks out or substantially 
reduces the expression of the targeted gene. PTGS can also be accomplished by use of DNA or DNA 
fragments as well. RNAi methods are described by Fire, A. et al. (1998; Nature 391:806-811) and 
Gura, T. (2000; Nature 404:804-808). PTGS can also be initiated by introduction of a 
complementary segment of DNA into the selected tissue using gene delivery and/or viral vector 
delivery methods described herein or known in the art. 

RNAi can be induced in mantunalian cells by the use of small interfering RNA also known as 
siRNA. SiRNA are shorter segments of dsRNA (typically about 21 to 23 nucleotides in length) that 
result in vivo from cleavage of mtroduced dsRNA by the action of an endogenous ribonuclease. 
SiRNA appear to be the mediators of the RNAi effect in mammals. The most effective siRNAs 
appear to be 21 nucleotide dsRNAs with 2 nucleotide 3' overhangs. The use of siRNA for inducing 
RNAi in mammalian cells is described by Elbashir, S.M. et al. (2001; Nature 41 1:494-498). 

SiRNA can either be generated indirectly by introduction of dsRNA into the targeted cell, or 
directly by mammalian transfection methods and agents described herein or known in the art (such as 
liposome-mediated transfection, viral vector methods, or other polynucleotide delivery/introductory 
methods). Suitable SiRNAs can be selected by examining a transcript of the target polynucleotide 
(e.g., mRNA) for nucleotide sequences downstream from the AUG start codon and recording the 
occurrence of each nucleotide and the 3 'adjacent 19 to 23 nucleotides as potential siRNA target sites, 
with sequences having a 21 nucleotide length being preferred. Regions to be avoided for target 
siRNA sites include the 5' and 3' untranslated regions (UTRs) and regions near the start codon (within 
75 bases), as these may be richer in regulatory protein binding sites. UTR-binding proteins and/or 
translation initiation complexes may interfere with binding of the siRNP endonuclease complex. The • 
selected target sites for siRNA can then be compared to the appropriate genome database (e.g., 
human, etc.) using BLAST or other sequence comparison algorithms known in the art. Target 
sequences with significant homology to other coding sequences can be elinoinated from consideration. 
The selected SiRNAs can be produced by chemical synthesis methods known m the art or by in vitro 
transcription using commercially available methods and kits such as the SILENCER siRNA 
construction kit (Ambion, Austin TX). 

In alternative embodiments, long-term gene silencing and/or RNAi effects can be induced in 
selected tissue using expression vectors that continuously express sLRNA. This can be accomplished 
using expression vectors that are engineered to express hairpin RNAs (shRNAs) using methods 
known in the art (see, e.g., Brunmielkamp, T.R. et al. (2002) Science 296:550-553; and Paddison, P.J. 
et al. (2002) Genes Dev. 16:948-958). In these and related embodiments, shRNAs can be delivered to 
target cells using expression vectors known in the art. An example of a suitable expression vector for 
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delivery of siRNA is the PSILENCER1.0-U6 (circular) plasmid (Ambion). Once delivered to the 
target tissue, shRNAs are processed in vivo into siRNA-like molecules capable of carrying out gene- 
specific silencing. 

In various embodiments, the expression levels of genes targeted by RNAi or PTGS methods 
can be determined by assays for mRNA and/or protein analysis. Expression levels of the mRNA of a 
targeted gene, can be determined by northern analysis methods using, for example, the 
NORTHERNMAX-GLY kit (Ambion); by microarray methods; by PGR methods; by real time PGR 
methods; and by other RNA/polynucleotide assays known in the art or described herein. Expression 
levels of the protein encoded by the targeted gene can be determined by Western analysis using 
standard techniques known in the art. 

An additional embodiment of the invention encompasses a method for screening for a 
compound which is effective in altering expression of a polynucleotide encoding PMMM. 
Compounds which may be effective in altering expression of a specific polynucleotide may include, 
but are not limited to, oligonucleotides, antisense oligonucleotides, triple helix-forming 
oligonucleotides, transcription factors and other polypeptide transcriptional regulators, and non- 
macromolecular chemical entities which are capable of interacting with specific polynucleotide 
sequences. Effective compounds may alter polynucleotide expression by acting as either inhibitors or 
promoters of polynucleotide expression. Thus, in the treatment of disorders associated with increased 
PMMM expression or activity, a compound which specifically inhibits expression of the 
polynucleotide encoding PMMM may be therapeutically useful, and in the treatment of disorders 
associated with decreased PMMM expression or activity, a compound which specifically promotes 
expression of the polynucleotide encoding PMMM may be therapeutically useful. 

In various embodiments, one or more test compounds may be screened for effectiveness in 
altering expression of a specific polynucleotide. A test compound may be obtained by any method 
commonly known in the art, including chemical modification of a compound known to be effective in 
altering polynucleotide expression; selection from an existing, conunercially-available or proprietary 
library of naturally-occurring or non-natural chemical compounds; rational design of a compound 
based on chemical and/or structural properties of the target polynucleotide; and selection from a 
library of chemical compounds created combinatorially or randomly. A sample comprising a 
polynucleotide encoding PMMM is exposed to at least one test compound thus obtained. The sample 
may comprise, for example, an intact or permeabUized cell, or an in vitro cell-free or reconstituted 
biochemical system. Alterations in the expression of a polynucleotide encoding PMMM are assayed 
by any method commonly known in the art. Typically, the expression of a specific nucleotide is 
detected by hybridization with a probe having a nucleotide sequence complementary to the sequence 
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of the polynucleotide encoding PMMM. The amount of hybridization may be quantified, thus 
forming the basis for a comparison of the expression of the polynucleotide both with and without 
exposure to one or more test compounds. Detection of a change in the expression of a polynucleotide 
exposed to a test compound indicates that the test compound is effective in altering the expression of 
the polynucleotide. A screen for a compound effective in altering expression of a specific 
polynucleotide can be carried out, for example, using a Schizosaccliaromyces pombe gene expression 
system (Atidns, D. et al. (1999) U.S. Patent No. 5,932,435; Amdt, G.M. et al. (2000) Nucleic Acids 
Res. 28:E15) or a human cell line such as HeLa cell (Clarke, M.L. et al. (2000) Biochem. Biophys. 
Res. Commun. 268:8-13). A particular embodiment of the present invention involves screening a 
combinatorial library of oligonucleotides (such as deoxyribonucleotides, ribonucleotides, peptide 
nucleic acids, and modified oligonucleotides) for antisense activity against a specific polynucleotide 
sequence (Bruice, T.W. et al. (1997) U.S. Patent No. 5,686,242; Bruice, T.W. et al. (2000) U.S. 
Patent No. 6,022,691). 

Many methods for introducing vectors into cells or tissues are available and equally suitable 
for use in vivo, in vitro, and ex vivo. For ex vivo therapy, vectors may be introduced into stem cells 
taken from the patient and clonally propagated for autologous transplant back into that same patient. 
Delivery by transfection, by liposome injections, or by polycationic amino polymers may be achieved 
using methods which are well known in the art (Goldman, C.K. et al. (1997) Nat. Biotechnol. 15:462- 
466). 

Any of the therapeutic methods described above may be applied to any subject in need of 
such therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 
monkeys. 

An additional embodiment of the invention relates to the administration of a composition 
which generally comprises an active ingredient formulated with a pharmaceutically acceptable 
excipient. Bxcipients may include, for example, sugars, starches, celluloses, gums, and proteins. 
Various formulations are commonly known and are thoroughly discussed in the latest edition of 
Rpminp ;ton's Pharmaceutical Sciences (Maack Publishing, Baston PA). Such compositions may 
consist of PMMM, antibodies to PMMM, and mimetics, agonists, antagonists, or inhibitors of 
PMMM. 

In various embodiments, the compositions described herein, such as pharmaceutical 
compositions, may be administered by any number of routes including, but not limited to, oral, 
intravenous, intramuscular, intra-arterial, intramedullary, intrathecal, intraventricular, pulmonary, 
transdermal, subcutaneous, intraperitoneal, intranasal, enteral, topical, sublingual, or rectal means. 
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Compositions for pulmonary administration may be prepared in liquid or dry powder form. 
These compositions are generally aerosolized immediately prior to inhalation by the patient. In the 
case of small molecules (e.g. traditional low molecular weight organic drugs), aerosol delivery of 
fast-acting formulations is well-known in the art. In the case of macromolecules (e.g. larger peptides 
and proteins), recent developments in the field of pulmonary delivery via the alveolar region of the 
lung have enabled the practical delivery of drugs such as insulin to blood circulation (see, e.g., Patton, 
J.S. et al., U.S. Patent No. 5,997,848). Puhnonary delivery allows administration without needle 
injection, and obviates the need for potentially toxic penetration enhancers. 

Compositions suitable for use in the invention include compositions wherein the active 
ingredients are contained in an effective amount to achieve the intended purpose. The determination 
of an effective dose is well within the capability of those skilled in the art. 

Specialized forms of compositions may be prepared for direct intracellular delivery of 
macromolecules comprising PMMM or fragments thereof. For example, liposome preparations 
containing a cell-impermeable macromolecule may promote cell fusion and intracellular delivery of 
the macromolecule. Alternatively, PMMM or a fragment thereof may be joined to a short cationic N- 
terminal portion from the HIV Tat-1 protein. Fusion proteins thus generated have been found to 
transduce into the cells of all tissues, including the brain, in a mouse model system (Schwarze, S.R. et 
al. (1999) Science 285:1569-1572). 

For any compound, the therapeutically effective dose can be estimated initially either in cell 
culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, 
monkeys, or pigs. An animal model may also be used to determine the appropriate concentration 
range and route of administration. Such information can then be used to determine useful doses and 
routes for administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example 
PMMM or fragments thereof, antibodies of PMMM, and agonists, antagonists or inhibitors of 
PMMM, which ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be 
determined by standard pharmaceutical procedures m cell cultures or with experimental anhnals, such 
as by calculating the ED50 (the dose therapeutically effective m 50% of the population) or LD50 (the 
dose lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is the 
therapeutic index, which can be expressed as the LDs^/EDjo ratio. Compositions which exhibit large 
therapeutic mdices are preferred. The data obtamed from cell culture assays and animal studies are 
used to formulate a range of dosage for human use. The dosage contained in such compositions is 
preferably within a range of circulating concentrations that includes the ED50 with little or no toxicity. 
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The dosage varies within this range depending upon the dosage form employed, the sensitivity of the 
patient, and the route of administration. 

The exact dosage will be determined by the practitioner, in light of factors related to the 
subject requiring treatment. Dosage and administration are adjusted to provide sufficient levels of the 
active moiety or to maintain the desired effect. Factors which may be taken into account include the 
severity of the disease state, the general health of the subject, the age, weight, and gender of the 
subject, time and frequency of administration, drug combination(s), reaction sensitivities, and 
response to therapy. Long-acting compositions may be administered every 3 to 4 days, every week, 
or biweekly depending on the half-life and clearance rate of the particular formulation. 

Normal dosage amounts may vary from about 0.1 /^g to 100,000 //g, up to a total dose of 
about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners in the art. 
Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 
conditions, locations, etc. 
DIAGNOSTICS 

In another embodiment, antibodies which specifically bind PMMM may be used for the 
diagnosis of disorders characterized by expression of PMMM, or in assays to monitor patients being 
treated with PMMM or agonists, antagonists, or inhibitors of PMMM. Antibodies useful for 
diagnostic purposes may be prepared in the same manner as described above for therapeutics. 
Diagnostic assays for PMMM include methods which utilize the antibody and a label to detect 
PMMM in human body fluids or in extracts of cells or tissues. The antibodies may be used with or 
without modification, and may be labeled by covalent or non-covalent attachment of a reporter 
molecule. A wide variety of reporter molecules, several of which are described above, are known in 
the art and may be used. 

A variety of protocols for measuring PMMM, including ELISAs. RIAs, and FACS, are 
known in the art and provide a basis for diagnosing altered or abnormal levels of PMMM expression. 
Normal or standard values for PMMM expression are established by combining body fluids or cell 
extracts taken from normal mammalian subjects, for example, human subjects, with antibodies to 
PMMM under conditions suitable for complex formation. The amount of standard complex 
formation may be quantitated by various methods, such as photometric means. Quantities of PMMM 
expressed in subject, control, and disease samples from biopsied tissues are compared with the 
standard values. Deviation between standard and subject values establishes the parameters for 
diagnosing disease. 
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In another embodiment of the invention, polynucleotides encoding PMMM may be used for 
diagnostic purposes. The polynucleotides which may be used include oligonucleotides, 
complementary RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect 
and quantify gene expression in biopsied tissues in which expression of PMMM may be correlated 
with disease. The diagnostic assay may be used to determine absence, presence, and excess 
expression of PMMM, and to monitor regulation of PMMM levels during therapeutic intervention. 

In one aspect, hybridization with PGR probes which are capable of detecting polynucleotides, 
including genomic sequences, encoding PMMM or closely related molecules may be used to identify 
nucleic acid sequences which encode PMMM. The specificity of the probe, whether it is made from 
a highly specific region, e.g., the 5' regulatory region, or from a less specific region, e.g., a conserved 
motif, and the stringency of the hybridization or amplification will determiae whether the probe 
identifies only naturally occurring sequences encoding PMMM, allelic variants, or related sequences. 

Probes may also be used for the detection of related sequences, and may have at least 50% 
sequence identity to any of the PMMM encoding sequences. The hybridization probes of the subject 
invention may be DNA or RNA and may be derived from the sequence of SEQ JD NO:59-116 or 
from genomic sequences including promoters, enhancers, and introns of the PMMM gene. 

Means for producing specific hybridization probes for polynucleotides encoding PMMM 
include the cloning of polynucleotides encoding PMMM or PMMM derivatives into vectors for the 
production of mRNA probes. Such vectors are known in the art, are commercially available, and may 
be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA 
polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a 
variety of reporter groups, for example, by radionuclides such as ^^P or ^^S, or by enzymatic labels, 
such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleotides encoding PMMM may be used for the diagnosis of disorders associated with 
expression of PMMM. Examples of such disorders include, but are not limited to, a gastrointestinal 
disorder, such as dysphagia, peptic esophagitis, esophageal spasm, esophageal stricture, esophageal 
carcinoma, dyspepsia, indigestion, gastritis, gastric carcinoma, anorexia, nausea, emesis, 
gastroparesis, antral or pyloric edema, abdominal angina, pyrosis, gastroenteritis, intestinal 
obstruction, infections of the intestinal tract, peptic ulcer, cholelithiasis, cholecystitis, cholestasis, 
pancreatitis, pancreatic carcinoma, biliary tract disease, hepatitis, hyperbilirubinemia, cirrhosis, 
passive congestion of the liver, hepatoma, infectious colitis, ulcerative colitis, ulcerative proctitis, 
Crohn's disease, Whipple* s disease, Mallory-Weiss syndrome, colonic carcinoma, colonic 
obstruction, irritable bowel syndrome, short bowel syndrome, diarrhea, constipation, gastrointestinal 
hemorrhage, acquired immunodeficiency syndrome (ADDS) enteropathy, jaundice, hepatic 
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encephalopathy, hepatorenal syndrome, hepatic steatosis, hemochromatosis, Wilson's disease, alphaj- 
antitrypsin deficiency, Reye's syndrome, primary sclerosing cholangitis, liver infarction, portal vein 
obstruction and thrombosis, centrilobular necrosis, poliosis hepatis, hepatic vein thrombosis, veno- 
occlusive disease, preeclampsia, eclampsia, acute fatty liver of pregnancy, intrahepatic cholestasis of 
pregnancy, and hepatic tumors including nodular hyperplasias, adenomas, and carcinomas; a 
cardiovascular disorder, such as arteriovenous fistula, atherosclerosis, hypertension, vasculitis, 
Raynaud's disease, aneurysms, arterial dissections, varicose veins, thrombophlebitis and 
phlebothrombosis, vascular tumors, and complications of thrombolysis, balloon angioplasty, vascular 
replacement, and coronary artery bypass graft surgery, congestive heart failure, ischemic heart 
disease, angina pectoris, myocardial infarction, hypertensive heart disease, degenerative valvular 
heart disease, calcific aortic valve stenosis, congenitally bicuspid aortic valve, mitral annular 
calcification, mitral valve prolapse, rheumatic fever and rheumatic heart disease, infective 
endocarditis, nonbacterial thrombotic endocarditis, endocarditis of systemic lupus erythematosus, 
carcinoid heart disease, cardiomyopathy, myocarditis, pericarditis, neoplastic heart disease, 
congenital heart disease, and complications of cardiac transplantation; an autoinunune/inflammatory 
disease, such as acquired immunodeficiency syndrome (AIDS), Addison's disease, adult respiratory 
distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, atherosclerosis, 
atherosclerotic plaque rupture, autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune 
polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, cholecystitis, contact 
dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, 
episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic 
gastritis, glomerulonephritis, Goodpasture's syndrome, gout. Graves' disease, Hashimoto's 
thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, 
myocardial or pericardial inflammation, osteoarthritis, degradation of articular cartilage, osteoporosis, 
pancreatitis, polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, Sjogren's 
syndrome, systenoic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, thrombocytopenic 
purpura, ulcerative colitis, uveitis, Werner syndrome, complications of cancer, hemodialysis, and 
extracorporeal circulation, viral, bacterial, fungal, parasitic, protozoal, and helminthic infections, and 
trauma; a cell proliferative disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, biursitis, 
chrhosis, hepatitis, mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal 
hemoglobinuria, polycythemia vera, psoriasis, primary thrombocythemia, and cancers including 
adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in 
particular, cancers of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, colon, gall 
bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, 
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penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; a developmental 
disorder, such as renal tubular acidosis, anemia. Gushing' s syndrome, achondroplastic dwarfism, 
Duchenne and Becker muscular dystrophy, bone resorption, epilepsy, gonadal dysgenesis, WAGR 
syndrome (Wilms' tumor, aniridia, genitoxuinary abnormalities, and mental retardation), Smith- 
Magenis syndrome, myelodysplastic syndrome, hereditary mucoepithelial dysplasia, hereditary 
keratodermas, hereditary neuropathies such as Charcot-Marie-Tooth disease and neurofibromatosis, 
hypothyroidism, hydrocephalus, seizure disorders such as Syndenham*s chorea and cerebral palsy, 
spina bifida, anencephaly, craniorachischisis, congenital glaucoma, cataract, age-related macular 
degeneration, and sensorineural hearing loss; an epithelial disorder, such as dyshidrotic eczema, 
allergic contact dermatitis, keratosis pilaris, melasma, vitiligo, actinic keratosis, basal cell carcinoma, 
squamous cell carcinoma, seborrheic keratosis, folliculitis, herpes simplex, herpes zoster, varicella, 
candidiasis, dermatophytosis, scabies, insect bites, cherry angioma, keloid, dermatofibroma, 
acrochordons, urticaria, transient acantholytic dermatosis, xerosis, eczema, atopic dermatitis, contact 
dermatitis, hand eczema, nummular eczema, lichen simplex chronicus, asteatotic eczema, stasis 
dermatitis and stasis ulceration, seborrheic dermatitis, psoriasis, lichen planus, pityriasis rosea, 
impetigo, ecthyma, dermatophytosis, tinea versicolor, warts, acne vulgaris, acne rosacea, pemphigus 
vulgaris, pemphigus foliaceus, paraneoplastic pemphigus, bullous pemphigoid, herpes gestationis, 
dermatitis herpetiformis, linear IgA disease, epidermolysis bullosa acquisita, dermatomyositis, lupus 
erythematosus, scleroderma and morphea, erythroderma, alopecia, figurate skin lesions, 
telangiectasias, hypopigmentation, hyperpigmentation, vesicles/bullae, exanthems, cutaneous drug 
reactions, papulonodular skin lesions, chronic non-healing wounds, photosensitivity diseases, 
epidermolysis bullosa simplex, epidermolytic hyperkeratosis, epidermolytic and nonepidermolytic 
palmoplantar keratoderma, ichthyosis bullosa of Siemens, ichthyosis exfoliativa, keratosis palmaris et 
plantaris, keratosis palmoplantaris, palmoplantar keratoderma, keratosis punctata, Meesmann's 
corneal dystrophy, pachyonychia congenita, white sponge nevus, steatocystoma multiplex, epidermal 
nevi/epidermolytic hyperkeratosis type, monilethrix, trichothiodystrophy, chronic 
hepatitis/cryptogenic chrhosis, and colorectal hyperplasia; a neurological disorder, such as epilepsy, 
ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease. Pick's disease, 
Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, amyotrophic 
lateral sclerosis and other motor neuron disorders, progressive neural muscular atrophy, retinitis 
pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating diseases, bacterial and 
viral meningitis, brain abscess, subdural empyema, epidural abscess, suppurative intracranial 
thrombophlebitis, myelitis and radiculitis, viral central nervous system disease, prion diseases 
including kuru, Creutzfeldt-Jakob disease, and Gerstmann-Straussler-Scheinker syndrome, fatal 



79 



wo 03/063688 



PCT/US03/02500 



familial insomnia, nutritional and metabolic diseases of the nervous system, neurofibromatosis, 
tuberous sclerosis, cerebelloretinal hemangioblastomatosis, encephalotrigeminal syndrome, mental 
retardation and other developmental disorders of the central nervous system including Down 
syndrome, cerebral palsy, neuroskeletal disorders, autonomic nervous system disorders, cranial nerve 
disorders, spinal cord diseases, muscular dystrophy and other neuromuscular disorders, peripheral 
nervous system disorders, dermatomyositis and polymyositis, inherited, metabolic, endocrine, and 
toxic myopathies, myasthenia gravis, periodic paralysis, mental disorders including mood, anxiety, 
and schizophrenic disorders, seasonal affective disorder (SAD), akathesia, amnesia, catatonia, 
diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, 
Tourette's disorder, progressive supranuclear palsy, corticobasal degeneration, and familial 
frontotemporal dementia; and a reproductive disorder, such as infertility, including tubal disease, 
ovulatory defects, and endometriosis, a disorder of prolactin production, a disruption of the estrous 
cycle, a disruption of the menstrual cycle, polycystic ovary syndrome, ovarian hyperstimulation 
syndrome, an endometrial or ovarian tirnior, a uterine fibroid, autoimmune disorders, an ectopic 
pregnancy, and teratogenesis; cancer of the breast, fibrocystic breast disease, and galactorrhea; a 
disruption of spermatogenesis, abnormal sperm physiology, cancer of the testis, cancer of the 
prostate, benign prostatic hyperplasia, prostatitis, Peyronie's disease, impotence, carcinoma of the 
male breast, and gynecomastia. Polynucleotides encoding PMMM may be used in Southern or 
northern analysis, dot blot, or other membrane-based technologies; in PCR technologies; in dipstick, 
pin, and multiformat ELISA-like assays; and in microarrays utilizing fluids or tissues from patients to 
detect altered PMMM expression. Such qualitative or quantitative methods are well knovm in the art. 

In a particular embodiment, polynucleotides encoding PMMM may be used in assays that 
detect the presence of associated disorders, particularly those mentioned above. Polynucleotides 
complementary to sequences encoding PMMM may be labeled by standard methods and added to a 
fluid or tissue sample from a patient under conditions suitable for the formation of hybridization 
complexes. After a suitable incubation period, the sample is washed and the signal is quantified and 
compared vrtth a standard value. If the amount of signal in the patient sample is significantly altered 
in comparison to a control sample then the presence of altered levels of polynucleotides encoding 
PMMM in the sample indicates the presence of the associated disorder. Such assays may also be 
used to evaluate the efficacy of a particular therapeutic treatment regimen in animal studies, in 
clinical trials, or to monitor the treatment of an individual patient. 

In order to provide a basis for the diagnosis of a disorder associated with expression of 
PMMM, a normal or standard profile for expression is established. This may be accomplished by 
combining body fluids or cell extracts taken from normal subjects, either animal or human, with a 
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sequence, or a fragment thereof, encoding PMMM, under conditions suitable for hybridization or 
amplification. Standard hybridization may be quantified by comparing the values obtained from 
normal subjects with values from an experiment in which a known amount of a substantially purified 
polynucleotide is used. Standard values obtained in this manner may be compared with values 
obtained from samples from patients who are symptomatic for a disorder. Deviation from standard 
values is used to establish the presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to determine if the level of expression in the 
patient begins to approximate that which is observed in the normal subject. The results obtained from 
successive assays may be used to show the efficacy of treatment over a period ranging from several 
days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 
overexpressed) in biopsied tissue from an individual may indicate a predisposition for the 
development of the disease, or may provide a means for detecting the disease prior to the appearance 
of actual clinical symptoms. A more definitive diagnosis of this type may allow health professionals 
to employ preventative measures or aggressive treatment earlier, thereby preventing the development 
or further progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequences encoding 
PMMM may involve the use of PGR. These oligomers may be chemically synthesized, generated 
enzymatically, or produced in vitro. Oligomers will preferably contain a fragment of a polynucleotide 
encoding PMMM, or a fragment of a polynucleotide complementary to the polynucleotide encoding 
PMMM, and will be employed under optimized conditions for identification of a specific gene or 
condition. Oligomers may also be employed under less stringent conditions for detection or 
quantification of closely related DNA or RNA sequences. 

In a particular aspect, oligonucleotide primers derived from polynucleotides encoding 
PMMM may be used to detect single nucleotide polymorphisms (SNPs). SNPs are substitutions, 
insertions and deletions that are a frequent cause of inherited or acquired genetic disease in humans. 
Methods of SNP detection include, but are not limited to, single-stranded conformation 
polymorphism (SSCP) and fluorescent SSCP (fSSCP) methods. In SSCP, oligonucleotide primers 
derived from polynucleotides encoding PMMM are used to amplify DNA using the polymerase chain 
reaction (PGR). The DNA may be derived, for example, from diseased or normal tissue, biopsy 
samples, bodily fluids, and the like. SNPs in the DNA cause differences in the secondary and tertiary 
structures of PGR products in single-stranded form, and these differences are detectable using gel 
electrophoresis in non-denaturing gels. In fSCGP, the oligonucleotide primers are fluorescently 
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labeled, which allows detection of the amplimers in high-throughput equipment such as DNA 
sequencing machmes. Additionally, sequence database analysis methods, tenned in silico SNP 
(isSNP), are capable of identifying polymorphisms by comparing the sequence of individual 
overlappmg DNA fragments which assemble into a common consensus sequence. These computer- 
based methods filter out sequence variations due to laboratory preparation of DNA and sequencing 
errors using statistical models and automated analyses of DNA sequence chromatograms. In the 
alternative, SNPs may be detected and characterized by mass spectrometry using, for example, the 
high throughput MASSARRAY system (Sequenom, Inc., San Diego CA). 

SNPs may be used to study the genetic basis of human disease. For example, at least 16 
common SNPs have been associated with non-insulin-dependent diabetes mellitus. SNPs are also 
useful for examining differences in disease outcomes in monogenic disorders, such as cystic fibrosis, 
sickle cell anemia, or chronic granulomatous disease. For example, variants in the mannose-binding 
lectin, MBL2, have been shown to be correlated with deleterious pulmonary outcomes in cystic 
fibrosis. SNPs also have utility in pharmacogenomics, the identification of genetic variants that 
influence a patient's response to a drug, such as life-threatening toxicity. For example, a variation in 
N-acetyl transferase is associated with a high incidence of peripheral neuropathy in response to the 
anti-tuberculosis drug isoniazid, while a variation in the core promoter of the ALOX5 gene results in 
diminished clinical response to treatment with an anti-asthma drug that targets the 5 -lipoxygenase 
pathway. Analysis of the distribution of SNPs in different populations is useful for investigating 
genetic drift, mutation, recombination, and selection, as well as for tracing the origins of populations 
and their migrations (Taylor, J.G. et al. (2001) Trends Mol. Med. 7:507-512; Kwok, P.-Y. and Z. Gu 
(1999) MoL Med. Today 5:538-543; Nowotny, P. et al. (2001) Curr. Opin. Neurobiol. 11:637-641). 

Methods which may also be used to quantify the expression of PMMM include radiolabeling 
or biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating results from 
standard curves (Melby, P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C. et al. (1993) 
Anal. Biochem. 212:229-236). The speed of quantitation of multiple samples may be accelerated by 
running the assay in a high-throughput format where the oligomer or polynucleotide of interest is 
presented in various dilutions and a spectrophotometric or colorimetric response gives rapid 
quantitation. 

In fiirther embodiments, oligonucleotides or longer fragments derived from any of the 
polynucleotides described herem may be used as elements on a microarray. The microarray can be 
used in transcript imaging techniques which monitor the relative expression levels of large numbers 
of genes simultaneously as described below. The microarray may also be used to identify genetic 
variants, mutations, and polymorphisms. This information may be used to determine gene function, 
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to understand the genetic basis of a disorder, to diagnose a disorder, to monitor 
progression/regression of disease as a function of gene expression, and to develop and monitor the 
activities of therapeutic agents in the treatment of disease. In particular, this information may be used 
to develop a pharmacogenomic profile of a patient in order to select the most appropriate and 
effective treatment regimen for that patient. For example, therapeutic agents which are highly 
effective and display the fewest side effects may be selected for a patient based on his/her 
pharmacogenomic profile. 

In another embodiment, PMMM, fragments of PMMM, or antibodies specific for PMMM 
may be used as elements on a microarray. The naicroarray may be used to monitor or measure 
protein-protein interactions, drug-target interactions, and gene expression profiles, as described 
above. 

A particular embodiment relates to the use of the polynucleotides of the present invention to 
generate a transcript image of a tissue or cell type. A transcript image represents the global pattern of 
gene expression by a particular tissue or cell type. Global gene expression patterns are analyzed by 
quantifying the number of expressed genes and their relative abundance under given conditions and at 
a given time (Seilhamer et al., "Comparative Gene Transcript Analysis," U.S. Patent No. 5,840,484; 
hereby expressly incorporated by reference herein). Thus a transcript image may be generated by 
hybridizing the polynucleotides of the present invention or their complements to the totality of 
transcripts or reverse transcripts of a particular tissue or cell type. In one embodiment, the 
hybridization takes place in high-throughput format, wherein the polynucleotides of the present 
invention or their complements comprise a subset of a plurality of elements on a microarray. The 
resultant transcript image would provide a profile of gene activity. 

Transcript images may be generated using transcripts isolated from tissues, cell lines, 
biopsies, or other biological samples. The transcript image may thus reflect gene expression in vivo, 
as in the case of a tissue or biopsy sample, or in vitroj as in the case of a cell line. 

Transcript images which profile the expression of the polynucleotides of the present 
invention may also be used in conjunction with in vitro model systems and preclinical evaluation of 
pharmaceuticals, as well as toxicological testing of industrial and naturaUy-occurring environmental 
compounds. All conqiounds induce characteristic gene expression patterns, frequently termed 
molecular fingerprints or toxicant signatures, which are indicative of mechanisms of action and 
toxicity (Nuwaysir, E.F. et al. (1999) Mol. Carcinog. 24:153-159; Steiner, S. and N.L. Anderson 
(2000) Toxicol. Lett. 1 12-1 13:467-471). If a test compound has a signature sunilar to that of a 
compound with known toxicity, it is likely to share those toxic properties. These fingerprints or 
signatures are most useful and refined when they contain expression information from a large number 
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of genes and gene families. Ideally, a genome-wide measurement of expression provides the highest 
quality signature. Even genes whose expression is not altered by any tested compounds are important 
as well, as the levels of expression of these genes are used to normalize the rest of the expression 
data. The normalization procedure is useful for comparison of expression data after treatment with 
different compounds. While the assignment of gene function to elements of a toxicant signature aids 
in interpretation of toxicity mechanisms, knowledge of gene function is not necessary for the 
statistical matching of signatures which leads to prediction of toxicity (see, for example. Press 
Release 00-02 firam the National Institute of Environmental Health Sciences, released February 29, 
2000, available at http://www.niehs.nih.gov/oc/news/toxchip.htm). Therefore, it is important and 
desirable in toxicological screening using toxicant signatures to include all expressed gene sequences. 

hi an embodiment, the toxicity of a test compound can be assessed by treating a biological 
sample containing nucleic acids with the test compound. Nucleic acids that are expressed in the 
treated biological sample are hybridized with one or more probes specific to the polynucleotides of 
the present invention, so that transcript levels corresponding to the polynucleotides of the present 
invention may be quantified. The transcript levels in the treated biological sample are compared with 
levels in an untreated biological sample. Differences in the transcript levels between the two samples 
are indicative of a toxic response caused by the test compound in the treated sample. 

Another embodiment relates to the use of the polypeptides disclosed herein to analyze the 
proteome of a tissue or cell type. The term proteome refers to the global pattern of protein expression 
in a particular tissue or cell type. Each protein component of a proteome can be subjected 
individually to further anialysis. Proteome expression patterns, or profiles, are analyzed by 
quantifying the number of expressed proteins and their relative abundance under given conditions and 
at a given time. A profile of a cell's proteome may thus be generated by separating and analyzing the 
polypeptides of a particular tissue or cell type. In one embodiment, the separation is achieved using 
two-dimensional gel electrophoresis, in which proteins from a sample are separated by isoelectric 
focusing in the first dimension, and then according to molecular weight by sodium dodecyl sulfate 
slab gel electrophoresis in the second dimension (Steiner and Anderson, supra). The proteins are 
visualized in the gel as discrete and uniquely positioned spots, typically by staining the gel with an 
agent such as Coomassie Blue or silver or fluorescent stains. The optical density of each protein spot 
is generally proportional to the level of the protein in the sample. The optical densities of 
equivalently positioned protein spots firom different samples, for example, from biological samples 
either treated or untreated with a test compound or therapeutic agent, are compared to identify any 
changes in protein spot density related to the treatment. The proteins in the spots are partially 
sequenced using, for example, standard methods employing chemical or enzymatic cleavage followed 
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by mass spectrometry. The identity of the protein in a spot may be determined by conq)aring its 
partial sequence, preferably of at least 5 contiguous amino acid residues, to the polypeptide sequences 
of interest. In some cases, further sequence data may be obtained for definitive protein identification. 

A proteomic profile may also be generated using antibodies specific for PMMM to quantify 
the levels of PMMM expression. In one embodiment, the antibodies are used as elements on a 
microarray, and protein expression levels are quantified by exposing the microarray to the sample and 
detecting the levels of protein bound to each array element (Lueking, A. et al. (1999) Anal. Biochem. 
270:103-111; Mendoze, L.G. et al. (1999) Biotechniques 27:778-788). Detection may be performed 
by a variety of methods known in the art, for example, by reacting the proteins in the sample with a 
thiol- or amino-reactive fluorescent compound and detecting the amount of fluorescence bound at 
each array element. 

Toxicant signatures at the proteome level are also useful for toxicological screening, and 
should be analyzed in parallel with toxicant signatures at the transcript level. There is a poor 
correlation between transcript and protein abundances for some proteins in some tissues (Anderson, 
N.L. and J. Seilhamer (1997) Electrophoresis 18:533-537), so proteome toxicant signatures may be 
useful in the analysis of compounds which do not significantly affect the transcript image, but which 
alter the proteomic profile. In addition, the analysis of transcripts in body fluids is difficult, due to 
rapid degradation of mRNA, so proteomic profiling may be more reliable and infonnative in such 
cases. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteins that are expressed in the treated 
biological sample are separated so that the amount of each protein can be quantified. The amount of 
each protein is compared to the amount of the corresponding protein in an untreated biological 
sample. A difference in the amount of protein between the two samples is indicative of a toxic 
response to the test compound in the treated sample. Individual proteins are identified by sequencing 
the amino acid residues of the individual proteins and comparing these partial sequences to the 
polypeptides of the present invention. 

la another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteins from the biological sample are 
incubated with antibodies specific to the polypeptides of the present invention. The amount of 
protein recognized by the antibodies is quantified. The amoimt of protein in the treated biological 
sample is compared with the amount in an untreated biological sample. A difference in the amount of 
protein between the two samples is indicative of a toxic response to the test compound in the treated 
sample. 
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Microarrays may be prepared, used, and analyzed using methods known in the art (Brennan, 
T.M. et al, (1995) U.S. Patent No. 5,474,796; Schena, M, et al. (1996) Proc. Natl. Acad. Sci. USA 
93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/25116; Shalon, D. et al. (1995) 
PCT appUcation WO95/35505; Heller, R.A. et al. (1997) Proc. Natl. Acad. Sci. USA 94:2150-2155; 
Heller, M J. et al. (1997) U.S. Patent No. 5,605,662). Various types of microarrays are well known 
and thoroughly described in Schena, M., ed. (1999; DNA Microarrays: A Practical Approach, Oxford 
University Press, London). 

In another embodiment of the invention, nucleic acid sequences encoding PMMM may be 
used to generate hybridization probes useful in mapping the naturally occurring genomic sequence. 
Either coding or noncoding sequences may be used, and in some instances, noncoding sequences may 
be preferable over coding sequences. For example, conservation of a coding sequence among 
members of a multi-gene family may potentially cause undesired cross hybridization during 
chromosomal mapping. The sequences may be mapped to a particular chromosome, to a specific 
region of a chromosome, or to artificial chromosome constructions, e.g., human artificial 
chromosomes (HACs), yeast artificial chromosomes (YACs), bacterial artificial chromosomes 
(BACs), bacterial PI constructions, or single chromosome cDNA libraries (Harrington, J.J. et al. 
(1997) Nat. Genet. 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; Trask, B.J. (1991) Trends 
Genet. 7:149-154). Once mapped, the nucleic acid sequences may be used to develop genetic linkage 
maps, for example, which correlate the inheritance of a disease state with the inheritance of a 
particular chromosome region or restriction fragment length polymorphism (RFLP) (Lander, E.S. and 
D. Botstein (1986) Proc. Natl. Acad. Sci. USA 83:7353-7357). 

Fluorescent in situ hybridization (FISH) may be correlated with other physical and genetic 
map data (Heinz-Ulrich, et al. (1995) in Meyers, supra, pp. 965-968). Examples of genetic map data 
can be found in various scientific journals or at the Online Mendelian Mheritance in Man (OMIM) 
World Wide Web site. Correlation between the location of the gene encoding PMMM on a physical 
map and a specific disorder, or a predisposition to a specific disorder, may help define the region of 
DNA associated with that disorder and thus may further positional cloning efforts. 

In situ hybridization of chromosomal preparations and physical mapping techniques, such as 
linkage analysis using established chromosomal markers, may be used for extending genetic maps. 
Often the placement of a gene on the chromosome of another mammalian species, such as mouse, 
may reveal associated markers even if the exact chromosomal locus is not known. This information 
is valuable to investigators searching for disease genes using positional cloning or other gene 
discovery techniques. Once the gene or genes responsible for a disease or syndrome have been 
crudely localized by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to 
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llq22-23, any sequences mapping to that area may represent associated or regulatory genes for 
further mvestigation (Gatti, R.A. et al. (1988) Nature 336:577-580). The nucleotide sequence of the 
instant invention may also be used to detect differences in the chromosomal location due to 
translocation, inversion, etc., among normal, carrier, or affected individuals. 

In another embodiment of the invention, PMMM. its catalytic or immunogenic fragments, or 
oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 
screening techniques. The fragment employed in such screening may be free in solution, affixed to a 
solid support, home on a cell surface, or located intracellularly. The formation of binding complexes 
between PMMM and the agent being tested may be measured. 

Another technique for drug screening provides for high throughput screening of compounds 
having suitable binding affinity to the protein of interest (Geysen, et al. (1984) PCT application 
WO84/03564). In this method, large numbers of different small test compounds are synthesized on a 
solid substrate. The test compounds are reacted with PMMM, or fragments thereof, and washed. 
Bound PMMM is then detected by methods well known in the art. Purified PMMM can also be 
coated directly onto plates for use in the aforementioned drug screening techniques. Alternatively, 
non-neutralizing antibodies can be used to capture the peptide and immobilize it on a solid support. 

In another embodiment, one.may use competitive drug screening assays in which neutralizing 

antibodies capable of binding PMMM specifically compete with a test compound for binding 

PMMM. In this manner, antibodies can be used to detect the presence of any peptide which shares 

one or more antigenic determinants with PMMM. 

In additional embodiments, the nucleotide sequences which encode PMMM may be used in 

any molecular biology techniques that have yet to be developed, provided the new techniques rely on 

properties of nucleotide sequences that are currently known, including, but not limited to, such 

properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the preceding 

description, utilize the present invention to its fullest extent. The following embodiments are, 

therefore, to be construed as merely illustrative, and not limitative of the remainder of the disclosure 

in any way whatsoever. 

The disclosures of all patents, applications, and publications mentioned above and below, 

including U.S. Ser. No. 60/351,928, U.S. Ser. No. 60/359,903, and U.S. Sen No. 60/366,837, are 

hereby expressly incorporated by reference. 

EXAMPLES 

I. Construction of cDNA Libraries 
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Incyte cDNAs were derived from cDNA libraries described in the LIFESEQ GOLD database 
(Ihcyte Genomics, Palo Alto CA). Some tissues were homogenized and lysed in guanidinium 
isothiocyanate, while others were homogenized and lysed in phenol or in a suitable mixture of 
denaturants, such as TRIZOL (Ihvitrogen), a monophasic solution of phenol and guanidme 
isothiocyanate. The resulting lysates were centrifuged over CsCl cushions or extracted with 
chloroform. RNA was precipitated from the lysates with either isopropanol or sodium acetate and 
ethanoU or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
purity. In some cases, RNA was treated with DNase. For most libraries, poly(A)+ RNA was isolated 
usmg oligo d(D-coupled paramagnetic particles (Promega), OLIGOTEX latex particles (QIAGEN, 
Chatsworth CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, RNA was 
isolated directly from tissue lysates using other RNA isolation kits, e.g., the POLY(A)PURE mRNA 
purification kit (Ambion, Austin TX). 

In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA 
libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the UNIZAP 
vector system (Stratagene) or SUPERSCRIPT plasmid system (Invitrogen), using the recommended 
procedures or similar methods known in the art (Ausubel et al., supra, ch. 5). Reverse transcription 
was initiated using oligo d(T) or random primers. Synthetic oligonucleotide adapters were ligated to 
double stranded cDNA, and the cDNA was digested with the appropriate restriction enzyme or 
enzymes. For most libraries, the cDNA was size-selected (300-1000 bp) using SEPHACRYL SIOOO, 
SEPHAROSE CL2B, or SEPHAROSE CL4B column chromatography (Amersham Biosciences) or 
preparative agarose gel electrophoresis. cDNAs were ligated into compatible restriction enzyme sites 
of the polylinker of a suitable plasmid, e.g., PBLUESCRIPT plasmid (Stratagene), PSPORTl plasmid 
(hivitrogen, Carlsbad CA), PCDNA2.1 plasmid (Invitrogen), PBK-CMV plasmid (Stratagene), PCR2- 
TOPOTA plasmid (Invitrogen), PCMV-ICIS plasmid (Stratagene), pIGEN (Incyte Genomics, Palo 
Alto CA), pRARE (Incyte Genomics), or pINCY (Incyte Genomics), or derivatives thereof. 
Recombinant plasmids were transformed into competent E, coli cells including XLl-Blue, XLl- 
BlueMRF, or SOLR from Stratagene or DH5a, DHIOB, or ElectroMAX DHIOB from Invitrogen. 
11- Isolation of cDNA Clones 

Plasmids obtained as described in Example I were recovered from host cells by in vivo 
excision using the UNIZAP vector system (Stratagene) or by cell lysis. Plasmids were purified using 
at least one of the following: a Magic or WIZARD Minipreps DNA purification system (Promega); 
an AGTC Miniprep purification kit (Edge Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, 
QIAWELL 8 Plus Plasmid, QIAWELL 8 Ultra Plasmid purification systems or the R.E.A.L. PREP 96 
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plasmid purification kit from QIAGEN. Following precipitation, plasmids were resuspended in 0.1 
ml of distilled water and stored, with or without lyophilization, at 4°C. 

Alternatively, plasmid DNA was amplified firom host cell lysates using direct link PGR in a 
high-throughput format (Rao, V.B. (1994) Anal, Biochem. 216:1-14). Host cell lysis and thermal 
cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 
384-well plates, and the concentration of amplified plasnoid DNA was quantified fluorometrically 
using PICOGREEN dye (Molecular Prohes, Eugene OR) and a FLUOROSKAN n fluorescence 
scanner (Labsystems Oy, Helsinki, Finland), 
m. Sequencing and Analysis 

Incyte cDNA recovered in plasmids as described in Example n were sequenced as follows. 
Sequencing reactions were processed using standard methods or high-throughput instrumentation 
such as the ABI CATALYST 800 (Applied Biosystems) thermal cycler or the PTC-200 thermal 
cycler (MJ Research) m conjunction with the HYDRA microdispenser (Robbins Scientific) or the 
MICROLAB 2200 (Hamilton) liquid transfer systenL cDNA sequencing reactions were prepared 
using reagents provided by Amersham Biosciences or supplied in ABI sequencing kits such as the 
ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Applied Biosystems). 
Electrophoretic separation of cDNA sequencing reactions and detection of labeled polynucleotides 
were carried out using the MEGAB ACE 1000 DNA sequencing system (Amersham Biosciences); the 
ABI PRISM 373 or 377 sequencing system (Applied Biosystems) in conjunction with standard ABI 
protocols and base calling software; or other sequence analysis systems known in the art. Reading 
frames within the cDNA sequences were identified using standard methods (Ausubel et aL, supra, ch. 
7). Some of the cDNA sequences were selected for extension using the techniques disclosed in 
Example Vm. 

The polynucleotide sequences derived from Incyte cDNAs were validated by removing 
vector, linker, and poly(A) sequences and by masking ambiguous bases, using algorithms and 
programs based on BLAST, dynamic programming, and dinucleotide nearest neighbor analysis. The 
Incyte cDNA sequences or translations thereof were then queried against a selection of public 
databases such as the GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases, and 
BLOCKS, PRINTS, DOMO, PRODOM; PROTEOME databases with sequences ftomHomo sapiens, 
Rattus norvegicusy Mus musculus, Caenorhabditis elegans, Saccharomyces cerevisiae^ 
Schizosaccharomyces pombe, and Candida albicans (Incyte Genomics, Palo Alto CA); hidden 
Markov model (HMM)-based protein family databases such as PFAM, INCY, and TIGRFAM (Haft, 
D.H. et al. (2001) Nucleic Acids Res. 29:41-43); and HMM-based protein domain databases such as 
SMART (Schultz, J. et aL (1998) Proc. Natl. Acad, Sci. USA 95:5857-5864; Letunic, 1. et al. (2002) 
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Nucleic Acids Res. 30:242-244). (HMM is a probabilistic approach which analyzes consensus 
primary structures of gene families; see, for example, Eddy, S.R. (1996) Curr. Opin. Struct. Biol. 
6:361-365.) The queries were performed using programs based on BLAST, FASTA, BLIMPS, and 
HMMER. The Incyte cDNA sequences were assembled to produce fiill length polynucleotide 
sequences. Alternatively, GenBank cDNAs, GenBankESTs, stitched sequences, stretched sequences, 
or Genscan-predicted coding sequences (see Examples IV and V) were used to extend Incyte cDNA 
assemblages to full length. Assembly was performed using programs based on Phred, Phrap, and 
Consed, and cDNA assemblages were screened for open reading frames using programs based on 
GeneMark, BLAST, and FASTA. The full length polynucleotide sequences were translated to derive 
the corresponding full length polypeptide sequences. Alternatively, a polypeptide may begin at any 
of the methionine residues of the full length translated polypeptide. Full length polypeptide 
sequences were subsequently analyzed by querying against databases such as the GenBank protein 
databases (genpept), SwissProt, the PROTEOME databases, BLOCKS, PRINTS, DOMO, PRODOM, 
Prosite, hidden Markov model (HMM)-based protein family databases such as PFAM, INCY, and 
TIGRFAM; and HMM-based protein domain databases such as SMART. Full length polynucleotide 
sequences are also analyzed using MACDNASIS PRO software (MiraiBio, Alameda CA) and 
LASERGENE software (DNASTAR). Polynucleotide and polypeptide sequence alignments are 
generated using default parameters specified by the CLUSTAL algorithm as incorporated into the 
MEGALIGN multisequence alignment program (DNASTAR), which also calculates the percent 
identity between aligned sequences. 

Table 7 summarizes the tools, programs, and algorithms used for the analysis and assembly of 
Incyte cDNA and full length sequences and provides applicable descriptions, references, and 
threshold parameters. The first column of Table 7 shows the tools, programs, and algorithms used, 
the second column provides brief descriptions thereof, the third column presents appropriate 
references, all of which are incorporated by reference herein in their entirety, and the fourth column 
presents, where applicable, the scores, probability values, and other parameters used to evaluate the 
strength of a match between two sequences (the higher the score or the lower the probability value, 
the greater the identity between two sequences). 

The programs described above for the assembly and analysis of full length polynucleotide 
and polypeptide sequences were also used to identify polynucleotide sequence fragments from SEQ 
ID NO:59-l 16. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization 
and amplification technologies are described in Table 4, column 2. 
IV. Identification and Editing of Coding Sequences from Genomic DNA 
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Putative protein modification and maintenance molecules were initially identified by running 
the Genscan gene identification program against public genomic sequence databases (e.g., gbpri and 
gbhtg). Genscan is a general-piupose gene identification program which analyzes genomic DNA 
sequences firom a variety of organisms (Surge, C. and S. Karlin (1997) J. MoL Biol. 268:78-94; 
Burge, C. and S. Karlin (1998) Curr. Opin. Struct Biol. 8:346-354). The program concatenates 
predicted exons to form an assembled cDNA sequence extending from a methionine to a stop codon. 
The output of Genscan is a FASTA database of polynucleotide and polypeptide sequences. The 
maximum range of sequence for Genscan to analyze at once was set to 30 kb. To determine which of 
these Genscan predicted cDNA sequences encode protein modification and maintenance molecules, 
the encoded polypeptides were analyzed by querying against PFAM models for protein modification 
and maintenance molecules. Potential protein modification and maintenance molecules were also 
identified by homology to Incyte cDNA sequences that had been annotated as protein modification 
and maintenance molecules. These selected Genscan-predicted sequences were then compared by 
BLAST analysis to the genpept and gbpri public databases. Where necessary, the Genscan-predicted 
sequences were then edited by comparison to the top BLAST hit ft*om genpept to correct errors in the 
sequence predicted by Genscan, such as extra or omitted exons. BLAST analysis was also used to 
find any Incyte cDNA or public cDNA coverage of the Genscan-predicted sequences, thus providing 
evidence for transcription. When Incyte cDNA coverage was available, this information was used to 
correct or confirm the Genscan predicted sequence. Full length polynucleotide sequences were 
obtained by assembling Genscan-predicted coding sequences with Incyte cDNA sequences and/or 
public cDNA sequences using the assembly process described in Example HI. Alternatively, full 
length polynucleotide sequences were derived entirely from edited or unedited Genscan-predicted 
coding sequences. 

V. Assembly of Genomic Sequence Data with cDNA Sequence Data 
"Stitched" Sequences 

Partial cDNA sequences were extended with exons predicted by the Genscan gene 
identification program described in Example IV. Partial cDNAs assembled as described m Example 
in were mapped to genomic DNA and parsed into clusters containing related cDNAs and Genscan 
exon predictions from one or more genomic sequences- Each cluster was analyzed using an algorithm 
based on graph theory and dynamic programming to integrate cDNA and genomic information, 
generating possible splice variants that were subsequently confirmed, edited, or extended to create a 
full length sequence. Sequence intervals in which the enture length of the interval was present on 
more than one sequence in the cluster were identified, and intervals thus identified were considered to 
be equivalent by transitivity. For example, if an interval was present on a cDNA and two genomic 
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sequences, then all three intervals were considered to be equivalent. This process allows unrelated 
but consecutive genomic sequences to be brought together, bridged by cDNA sequence. Intervals 
thus identified were then "stitched" together by the stitching algorithm in the order that they appear 
along their parent sequences to generate the longest possible sequence, as well as sequence variants. 
Linkages between intervals which proceed along one type of parent sequence (cDNA to cDNA or 
genomic sequence to genomic sequence) were given preference over linkages which change parent 
type (cDNA to genomic sequence). The resultant stitched sequences were translated and compared 
by BLAST analysis to the genpept and gbpri public databases. Incorrect exons predicted by Genscan 
were corrected by comparison to the top BLAST hit from genpept. Sequences were further extended 
with additional cDNA sequences, or by inspection of genomic DNA, when necessary. 
"Stretched" Sequences 

Partial DNA sequences were extended to full length with an algorithm based on BLAST 
analysis. First, partial cDNAs assembled as described in Example m were queried against public 
databases such as the GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases 
using the BLAST program. The nearest GenBank protein homolog was then compared by BLAST 
analysis to either Incyte cDNA sequences or GenScan exon predicted sequences described in 
Example IV. A chimeric protein was generated by using the resultant high-scoring segment pairs 
(HSPs) to map the translated sequences onto the GenBank protein homolog. Insertions or deletions 
may occur in the chimeric protein with respect to the original GenBank protein homolog. The 
GenBank protein homolog, the chimeric protein, or both were used as probes to search for 
homologous genomic sequences from the public human genome databases. Partial DNA sequences 
were therefore **stretched" or extended by the addition of homologous genomic sequences. The 
resultant stretched sequences were examined to determine whether it contained a complete gene. 
VI. Chromosomal Mapping of PMMM Encoding Polynucleotides 

The sequences which were used to assemble SEQ ID NO:59-116 were compared with 
sequences from the Incyte LIFESEQ database and public domain databases using BLAST and other 
implementations of the Smith-Waterman algorithm. Sequences from these databases that matched 
SEQ ID NO:59-l 16 were assembled into clusters of contiguous and overlapping sequences using 
assembly algorithms such as Phrap (Table 7). Radiation hybrid and genetic mapping data available 
from public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for 
Genome Research (WIGR), and Gen6thon were used to determine if any of the clustered sequences 
had been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the assignment 
of all sequences of that cluster, including its particular SEQ ID NO:, to that map location. 
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Map locations are represented by ranges, or intervals, of human chromosomes. The map 
position of an interval, in centiMorgans, is measured relative to the terminus of the chromosome's p- 
arm. (The centiMorgan (cM) is a unit of measurement based on recombination frequencies between 
chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase (Mb) of DNA in 
humans, although this can vary widely due to hot and cold spots of recombination,) The cM 
distances are based on genetic markers mapped by Genethon which provide boundaries for radiation 
hybrid markers whose sequences were included in each of the clusters. Human genome maps and 
other resources available to the public, such as die NCBI "GeneMap*99" World Wide Web site 
(http://www.ncbi.nlm.nih.gov/genemap/), can be employed to determine if previously identified 
disease genes map within or in proximity to the intervals indicated above. 
Association of PMMM polynucleotides with Lung Cancer 

Heritable forms of lung carcinoma have not been reported and thus, identification of 
relevant disease-associated genes through conventional linkage analysis is not possible. 
However, several studies of sporadic nonsmall cell lung carcinoma (NSCLC) tumors have 
reported loss of heterozygosity (LOH>in regions of chromosome 1 1 suggesting the presence of 
one or more tumor suppressor genes (Bepler, G. and Garcia-Blanco, M.A. (1994) Proc. Natl. 
Acad. Sci. USA 91:5513-7; Bzuka, M. (1995) Genes, Chromosomes & Cancer 13:40-46; Rasio, 
D. (1995) Cancer Research 55:3988-91). In a study of 79 patients with lung cancer, lizuka and 
coworkers found that 1 lql4-l lq24.2 was deleted in inany of the lung tumors studied. Mapping 
of this region with additional markers showed that the region of chromosome llq bounded by 
markers Dl 1S939 and Dl 1S938 was commonly deleted (lizuka, et al., supra). In another study it 
was shown that human A549 NSCLC cells transformed witti a human-derived YAC clone 
containing a region of chromosome 1 Iq within the region bounded by Dl 1S939 and Dl 1S938, 
exhibited litde or no increase in cell number (versus control cells whose number increased 5-10- 
fold in the same 5 day period). Fiuther studies of these hybrid cells showed a decrease in 
tumorigenicity and an increase in latency following injection into athymic, nude mice, as 
compared with mice injected witii control A549 cells. These studies suggest the presence of a 
tumor suppressor gene within this region of chromosome llq and support the association of LOH 
in this region with NSCLC. 

Restriction fragment lengtii polymorphism (RFLP) markers shown to be near regions of 
DNA known as sequence-tagged sites (STS), have been mapped to NT_Contigs generated by the 
Human Genome Project using ePCR (Schuler, G.D. (1997) Genome Research 7: 541-550, and 
(1998) Trends Biotechnol. 16(ll):456-459). Contigs containing regions of DNA with known 
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disease-associated markers are therefore used to identify PMMM sequences that map to 
disease-associated regions of the genome. 

Polynucleotides encoding PMMM were mapped to NT_Contigs. Contigs longer than 
1Mb were broken into subcontigs of 1Mb length with overlapping sections of lOOkb. A 
preliminary step used an algorithm, similar to MEGABLAST, to define the mRNA sequence 
/masked genomic DNA contig pairings. The cDNA/genonoic pairings identified by the first 
algorithm were confirmed, and the PMMM polynucleotides mapped to DNA contigs, using SIM4 
(Horea, L. et al. (1998) Genome Res. 8:967-74, version May 2000) which had been optimized for 
high throughput processing and strand assignment confidence). The SIM4 output of the mRNA 
sequence/genomic contig pairs was further processed to determine the correct location of the 
PMMM polynucleotides on the genonwc contig, as well as their strand identity. 

SEQ ID NO:66 was mapped to NT^Contig GBI:NT_009151_019.8; SEQ ID NO:76-77, 
SEQ ID NO:89--90, and SEQ ID NO:104 were mapped to NT__Contig GBI:NT„00915 1^022.8; 
and SEQ ID NO:96 was mapped to NT__Contig GBI:NT_^009151_020.8 fi-om Genbank, version 
128, covering a 5.5 Mb region of the genome that also contains lung cancer-associated genetic 
markers Dl 1S939 and Dl 1S938. The maximum distance between SEQ ID NO:66, SEQ ED 
NO:76-77, SEQ ID NO:89-90, SEQ ID NO: 104, and SEQ ID NO:96 and markers Dl 1S939 and 
D11S938, therefore, is 5.5 Mb. Thus, SEQ ID NO:66, SEQ ID NO:76-77, SEQ ID NO:89-90, 
SEQ ID NO: 104, and SEQ ID NO:96 are in proximity with genetic markers shown to 
consistently associate with lung cancer. Therefore, in various embodiments, SEQ ID NO:66, 
SEQ ID NO:76-77, SEQ ID NO:89-90, SEQ ID NO: 104, and SEQ ID NO:96 can be used for one 
or more of the following: i) determination of LOH in persons with lung cancer in the lung cancer 
disease region at llql2-24.2, ii) diagnostic assays for lung cancer, and iii) developing 
therapeutics and/or other treatments for lung cancer. 
Association of PMMM polynucleotides with osteoarthritis 

Markers that map to regions associated with particular diseases can be used to develop 
diagnostic and therapeutic tools. Disease association of a chromosome locus is expressed as the lod 
(logarithm of odds) score. The lod score is the logarithm to base 10 of the odds in favor of linkage. 
Linkage is defined as the tendency of two genes located on the same chromosome to be inherited 
together through meiosis {Genetics in Medicine, Fifth Edition, (1991) Thompson, M.W. et al. W.B. 
Saunders Co. Philadelphia). A logarithm of the odds ratio for linkage (lod) score of 2 indicates a 
probability of 1 in 100 that the marker was found solely by chance in affected individuals. In a study 
of 48 families affected by osteoarthritis (OA), Loughlin et al. (Rheumatology (2000) 39:377-381) 
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identified D2S202/D2S72 and D2S1 17 as two genetic markers with a multiple lod of 2.19 for linkage 
to OA of the hip. 

Restriction fragment length polymorphism (RFLP) markers shown to he near regions of DNA 
known as sequence-tagged sites (STS), have been mapped to NT_Contigs generated by the Human 
Genome Project using ePCR (Schuler, G.D. (1997) Genome Research 7: 541-550. and (1998) Trends 
Biotechnol. 16(1 1):456-9). Contigs containing regions of DNA with known disease-associated 
markers are therefore used to identify PMMM sequences that map to disease-associated regions of the 
genome. 

Polynucleotides encoding PMMM were mapped to NT_Contigs. Contigs longer than 1Mb 
were broken into subcontigs of 1Mb length with overlapping sections of lOOkb. A preliminary step 
used an algorithm, similar to MEGABLAST, to define the mRNA sequence /masked genomic DNA 
contig pairings. The cDNA/genomic pairings identified by the first algorithm were confirmed, and 
the PMMM polynucleotides mapped to DNA contigs, using SIM4 (Florea, L. et al. (1998) Genome 
Res. 8:967-74, version May 2000) which had been optimized for high throughput processing and 
strand assignment confidence. The SIM4 output of the mRNA sequence/genomic contig pairs was 
further processed to determine the correct location of the PMMM polynucleotides on the genomic 
contig, as well as their strand identity. 

SEQ ID NO: 112-113 were mapped to NT_Contig GBI:NT_005229_002.8 from Genbank, 
version 128, covering a 6.45 Mb region of the genome that also contains OA-associated genetic 
markers D2S117 and D2S72. The maximum distance between SEQ ID NO: 1 12-113 and markers 
D2S1 17 and D2S72, therefore, is 6.45 Mb. Thus, SEQ ID NO: 1 12-1 13 are in proximity with genetic 
markers shown to consistently associate with OA. Therefore, in various embodiments, SEQ ID 
NO: 1 12-1 13 can be used for one or more of the following: i) linkage analysis of persons and/or 
families to the OA disease region at 2ql2-q22, ii) diagnostic assays for OA and interleukin expression 
abnormalities, and iii) developing therapeutics and/or other treatments for OA. 
Vn. Analysis of Polynucleotide Expression 

Northem analysis is a laboratory technique used to detect the presence of a transcript of a 
gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs 
firom a particular cell type or tissue have been bound (Sambrook and Russell, supra, ch. 7; Ausubel et 
al., supra, ch. 4). 

Analogous computer techniques applying BLAST were used to search for identical or related 
molecules in databases such as GenBank or LIErESEQ (Incyte Genomics). This analysis is much 
faster than multiple membrane-based hybridizations. In addition, the sensitivity of the computer 
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search can be modified to determine whether any particular match is categorized as exact or similar. 
The basis of the search is the product score, which is defined as: 

BLAST Score x Percent Identity 
5 X minimum {length(Seq. 1), length(Seq. 2)} 

The product score takes into account both the degree of similarity between two sequences and the 
length of the sequence match. The product score is a normalized value between 0 and 100, and is 
calculated as follows: the BLAST score is multiplied by the percent nucleotide identity and the 
product is divided by (5 times the length of the shorter of the two sequences). The BLAST score is 
calculated by assigning a score of +5 for every base that matches in a high-scoring segment pair 
(HSP), and -4 for every mismatch. Two sequences may share more than one HSP (separated by 
gaps). If there is more than one HSP, then the pak with the highest BLAST score is used to calculate 
the product score. The product score represents a balance between fractional overlap and quality in a 
BLAST alignment. For example, a product score of 100 is produced only for 100% identity over the 
entire length of the shorter of the two sequences being compared. A product score of 70 is produced 
either by 100% identity and 70% overlap at one end, or by 88% identity and 100% overlap at the 
other. A product score of 50 is produced either by 100% identity and 50% overlap at one end, or 79% 
identity and 100% overlap. 

Alternatively, polynucleotides encoding PMMM are analyzed with respect to the tissue 
sources from which they were derived. For example, some full length sequences are assembled, at 
least in part, with overlapping Incyte cDNA sequences (see Example IH). Each cDNA sequence is 
derived from a cDNA library constructed from a human tissue. Each human tissue is classified into 
one of the following organ/tissue categories: cardiovascular system; connective tissue; digestive 

^ system; embryonic structures; endocrine system; exocrine glands; genitalia, female; genitalia, male; 
germ cells; hemic and immune system; Uver; musculoskeletal system; nervous system; pancreas; 
respiratory system; sense organs; skin; stomatognathic system; unclassified/mixed; or urinary tract. 
The number of libraries in each category is counted and divided by the total number of libraries 
across all categories. Similarly, each human tissue is classified into one of the following 

) disease/condition categories: cancer, cell line, developmental, inflanMuation, neurological, trauma, 
cardiovascular, pooled, and other, and the number of libraries in each category is counted and divided 
by the total number of libraries across all categories. The resulting percentages reflect the tissue- and 
disease-specific expression of cDNA encoding PMMM. cDNA sequences and cDNA library/tissue 
information are found in the LIFESEQ GOLD database (Incyte Genomics, Palo Alto CA). 
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Vm. Extension of PMMM Encoding Polynucleotides 

Full length polynucleotides are produced by extension of an appropriate fragment of the full 
length molecule using oligonucleotide primers designed from this fragment. One primer was 
synthesized to initiate 5 'extension of the known fragment, and the other primer was synthesized to 
initiate 3' extension of the known fragment. The initial primers were designed using OLIGO 4.06 
software (National Biosciences), or another appropriate program, to be about 22 to 30 nucleotides in 
length, to have a GC content of about 50% or more, and to anneal to the target sequence at 
temperatures of about 68°C to about 72°C. Any stretch of nucleotides which would result in hairpin 
structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one 
extension was necessary or desired, additional or nested sets of primers were designed. 

High fidelity amplification was obtained by PGR using methods well known in the art. PGR 
was performed in 96-well plates using the PTG-200 thermal cycler (MJ Research, Inc.). The reaction 
mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg^^, (NH4)2S04, 

\ r 

and 2-mercaptoethariol, Taq DNA polymerase (Amersham Biosciences), ELONGASE enzyme 
(Invitrogen), and Pfu DNA polymerase (Stratagene), with the following parameters for primer pair 
PCI A and PCI B: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 68°C, 2 
min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68 °C, 5 min; Step 7: storage at 4*'C. In the 
alternative, the parameters for primer pair T7 and SK+ were as follows: Step 1: 94°C, 3 min; Step 2: 
94 °C, 15 sec; Step 3: 57°C, 1 min; Step 4: 68^C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; 
Step 6: 68°C, 5 min; Step 7: storage at 4°C. 

The concentration of DNA in each well was determined by dispensing 100 fil PICOGREEN 
quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 
and 0.5 fil of undiluted PGR product into each well of an opaque fluorimeter plate (Coming Costar, 
Acton MA), allowing the DNA to bind to the reagent. The plate was scanned in a Fluoroskan n 
(Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the 
concentration of DNA. A 5 /^l to 10 jjI aliquot of the reaction mixture was analyzed by 
electrophoresis on a 1 % agarose gel to determine which reactions were successful in extending the 
sequence. 

The extended nucleotides were desalted and concentrated, transferred to 384-well plates, 
digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
sonicated or sheared prior to religation into pUC 18 vector (Amersham Biosciences). For shotgun 
sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) agarose gels, 
fragments were excised, and agar digested with Agar ACE (Promega). Extended clones were 
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reUgated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector (Amersham 
Biosciences), treated with Pfu DNA polymerase (Stratagene) to fiU-in restriction site overhangs, and 
transfected into competent E. coli cells. Transformed cells were selected on antibiotic-containing 
media, and individual colonies were picked and cultured overnight at 37 °C in 384-well plates in 
LB/2x carb liquid media. 

The cells were lysed, and DNA was amplified by PGR using Taq DNA polymerase 
(Amersham Biosciences) and Pfii DNA polymerase (Stratagene) with the following parameters: Step 
1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60''C, 1 min; Step 4: 72**C, 2 min; Step 5: steps 2, 3, 
and 4 repeated 29 thnes; Step 6: 72°C, 5 min; Step 7: storage at 4°C. DNA was quantified by 
PICOGREEN reagent (Molecular Probes) as described above. Samples with low DNA recoveries 
were reamplified using the same conditions as described above. Samples were diluted with 20% 
dimethysulfoxide (1:2, v/v), and sequenced using DYENAMIC energy transfer sequencing primers 
and the DYENAMIC DIRECT kit (Amersham Biosciences) or the ABI PRISM BIGDYE Terminator 
cycle sequencing ready reaction kit (Applied Biosystems). 

In like manner, full length polynucleotides are verified using the above procedure or are used 
to obtain 5 'regulatory sequences using the above procedure along with oligonucleotides designed for 
such extension, and an appropriate genomic library. 

IX. Identification of Single Nucleotide Polymorphisms in PMMM Encoding Polynucleotides 

Common DNA sequence variants known as single nucleotide polymorphisms (SNPs) were 
identified in SEQ ID NO:59-l 16 using the LIFESEQ database (hicyte Genomics). Sequences from 
the same gene were clustered together and assembled as described in Example HI, allowing the 
identification of all sequence variants in the gene. An algorithm consisting of a series of filters was 
used to distinguish SNPs from other sequence variants. Preliminary filters removed the majority of 
basecaU errors by requiring a minimum Phred quality score of 15, and removed sequence alignment 
errors and errors resulting from improper trimming of vector sequences, chimeras, and splice 
variants. An automated procedure of advanced chromosome analysis analysed the original 
chromatogram files in the vicmity of the putative SNP. Clone error filters used statistically generated 
algorithms to identic errors introduced during laboratory processing, such as those caused by reverse 
transcriptase, polymerase, or somatic mutation. Clustering error filters used statistically generated 
\ algorithms to identify errors resulting from clustering of close homologs or pseudogenes, or due to 
contamination by non-human sequences. A final set of filters removed duplicates and SNPs found in 
immunoglobulins or T-cell receptors. 

Certain SNPs were selected for further characterization by mass spectrometry using die high 
throughput MASSARRAY system (Sequenom, Inc.) to analyze allele frequencies at the SNP sites in 
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four different human populations. The Caucasian population comprised 92 individuals (46 male, 46 
female), including 83 from Utah, fom* French, three Venezualan, and two Amish individuals. The 
African population comprised 194 individuals (97 male, 97 female), all African Americans. The 
Hispanic population comprised 324 individuals (162 male, 162 female), all Mexican Hispanic. The 
Asian population comprised 126 individuals (64 male, 62 female) with a reported parental breakdown 
of 43% Chinese, 31% Japanese, 13% Korean, 5% Vietnamese, and 8% other Asian. Allele 
frequencies were first analyzed in the Caucasian population; in some cases those SNPs which showed 
no allelic variance in this population were not further tested in the other three populations. 

X. Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO:59-116 are employed to screen cDNAs, 
genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base 
pairs, is specifically described, essentially the same procedure is used with larger nucleotide 
fragments- Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 
software (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 /^Ci of 
[y-^^P] adenosine triphosphate (Amersham Biosciences), and T4 polynucleotide kinase (DuPont NEN, 
Boston MA). The labeled oligonucleotides are substantially purified using a SEPHADEX G-25 
superfine size exclusion dextran bead column (Amersham Biosciences), An aliquot containing 10^ 
counts per minute of the labeled probe is used in a typical membrane-based hybridization analysis of 
human genomic DNA digested with one of the following endonucleases: Ase I, Bgl n, Eco RI, Pst I, 
Xba I, or Pvu H (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 
membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 
hours at 40°C. To remove nonspecific signals, blots are sequentially washed at room temperature 
under conditions of up to, for example, 0. 1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 
Hybridization patterns are visualized using autoradiography or an altemative imaging means and 
compared. 

XI. Microarrays 

The linkage or synthesis of array elements upon a microarray can be achieved utilizing 
photolithography, piezoelectric printing (ink-jet printing; see, e.g., Baldeschweiler et al, supra), 
mechanical microspotting technologies, and derivatives thereof. The substrate in each of the 
aforementioned technologies should be uniform and solid with a non-porous surface (Schena, M., ed. 
(1999) DNA Microarravs: A Practical Approach . Oxford University Press, London). Suggested 
substrates include silicon, silica, glass slides, glass chips, and silicon wafers. Alternatively, a 
procedure analogous to a dot or slot blot may also be used to arrange and link elements to the surface 
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of a substrate using thermal, UV, chemical, or mechanical bonding procedures. A typical array may 
be produced using available methods and machines well known to those of ordinary skill in the art 
and may contain any appropriate number of elements (Schena, M. et al. (1995) Science 270:467-470; 
Shalon, D. et al. (1996) Genome Res. 6:639-645; Marshall, A. and J. Hodgson (1998) Nat. 
Biotechnol. 16:27-31). 

Full length cDNAs, Expressed Sequence Tags (ESTs), or fragments or oligomers thereof may 
comprise the elements of the microarray. Fragments or oligomers suitable for hybridization can be 
selected using software well known in the art such as LASERGENE software (DNASTAR). The 
array elements are hybridized with polynucleotides in a biological sample. The polynucleotides in 
the biological sample are conjugated to a fluorescent label or other molecular tag for ease of 
detection. After hybridization, nonhybridized nucleotides from the biological sample are removed, 
and a fluorescence scanner is used to detect hybridization at each array element. Alternatively, laser 
desorbtion and mass spectrometry may be used for detection of hybridization. The degree of 
complementarity and the relative abundance of each polynucleotide which hybridizes to an element 
on the microarray may be assessed. In one embodiment, microarray preparation and usage is 
described in detail below. 
Tissue or Cell Sample Preparation 

Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 
polyCA)"" RNA is purified using the oligo-(dT) cellulose method. Each polyCA)"^ RNA sample is 
reverse transcribed using MMLV reverse-transcriptase, 0.05 pg/jLtl oligo-(dT) primer (21mer), IX 
furst strand buffer, 0.03 unitsZ/il RNase mhibitor, 500 fiM dATP, 500 fiM dGTP, 500 [iM dTTP, 40 
fiM dCTP, 40 fxM dCTP-Cy3 (BDS) or dCTP-Cy5 (Amersham Biosciences). The reverse 
transcription reaction is performed in a 25 ml volume containing 200 ng poly(A)^ RNA with 
GEMB RIGHT kits (Incyte Genomics). Specific control poIyCA)"^ RNAs are synthesized by in vitro 
transcription from non-coding yeast genomic DNA. After incubation at 37° C for 2 hr, each reaction 
sample (one with Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium 
hydroxide and incubated for 20 minutes at 85° C to the stop the reaction and degrade the RNA. 
Samples are purified using two successive CHROMA SPIN 30 gel filtration spin columns (Clontech, 
Palo Alto CA) and after combining, both reaction samples are ethanol precipitated using 1 ml of 
glycogen (1 mg/ml), 60 ml sodium acetate, and 300 ml of 100% ethanol. The sample is then dried to 
completion using a SpeedVAC (Savant Instruments Inc., Holbrook NY) and resuspended in 14 /xl 5X 
SSC/0.2% SDS. 
Microarray Preparation 
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Sequences of the present invention are used to generate array elements. Each array element 
is amplified from bacterial cells containing vectors with cloned cDNA inserts. PGR amplification 
uses primers complementary to the vector sequences flanking the cDNA insert. Array elements are 
amplified in thirty cycles of PGR from an initial quantity of 1-2 ng to a final quantity greater than 5 
jxg. Amplified array elements are then purified using SEPHACRYL-400 (Amersham Biosciences). 

Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 
slides (Coming) are cleaned by ultrasound in 0.1% SDS and acetone, with extensive distilled water 
washes between and after treatments. Glass slides are etched m 4% hydrofluoric acid (VWR 
Scientific Products Corporation (VWR), West Chester PA), washed extensively in distilled water, 
and coated with 0.05% aminopropyl silane (Sigma-Aldrich, St. Louis MO) in 95% ethanol. Coated 
slides are cured in a 1 10°G oven. 

Array elements are applied to the coated glass substrate using a procedure described in U.S. 
Patent No. 5,807,522, incorporated herein by reference. 1 /xl of the array element DNA, at an average 
concentration of 100 ng//il, is loaded into the open capillary printing element by a high-speed robotic 
apparatus. The apparatus then deposits about 5 nl of array element sample per slide. 

Microarrays are UV-crosslinked using a STRATALINKER UV-crosslinker (Stratagene). 
Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 
Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 
buffered saline (PBS) (Tropix, Inc., Bedford MA) for 30 minutes at 60° C followed by washes in 
0.2% SDS and distilled water as before. 
Hybridization 

Hybridization reactions contain 9 ^1 of sample mixture consisting of 0.2 fig each of Cy3 and 
Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. The sample 
mixture is heated to 65°C for 5 minutes and is aliquoted onto the microarray surface and covered 
with an 1.8 cm^ coverslip. The arrays are transferred to a waterproof chamber having a cavity just 
slightly larger than a microscope slide. The chamber is kept at 100% humidity internally by the 
addition of 140 fxl of 5X SSC in a comer of the chamber. The chamber containing the arrays is 
incubated for about 6.5 hours at 60° C. The arrays are washed for 10 min at 45° C in a first wash 
buffer (IX SSC, 0.1% SDS), three times for 10 minutes each at 45°C in a second wash buffer (O.IX 
SSC), and dried. 
Detection 

Reporter-labeled hybridization complexes are detected with a microscope equipped with an 
Innova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral lines 
at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5. The excitation laser light is 
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focused on the array using a 20X microscope objective (Nikon, Inc., Melville NY). The slide 
containing the array is placed on a computer-controlled X-Y stage on the microscope and raster- 
scanned past the objective. The 1.8 cm x 1.8 cm array used in the present example is scanned with a 
resolution of 20 micrometers. 

In two separate scans, a mixed gas multilme laser excites the two fluorophores sequentially. 
Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, 
Hamamatsu Photonics Systems, Bridgewater NJ) corresponding to the two fluorophores. 
Appropriate filters positioned between the array and the photomultiplier tubes are used to filter the 
signals. The emission maxima of the fluorophores used are 565 nm for Cy3 and 650 nm for Cy5. 
Each array is typically scanned twice, one scan per fluorophore using the appropriate filters at the 
laser source, although the apparatus is capable of recording the spectra from both fluorophores 
simultaneously. 

The sensitivity of the scans is typically calibrated using the signal intensity generated by a 
cDNA control species added to the sample mixture at a known concentration. A specific location on 
the array contains a complementary DNA sequence, allowing the intensity of the signal at that 
location to be correlated with a weight ratio of hybridizing species of 1:100,000. When two samples 
from different sources (e.g., representing test and control cells), each labeled with a different 
fluorophore, are hybridized to a single array for the purpose of identi:fying genes that are 
differentially expressed, the calibration is done by labeling samples of the calibrating cDNA with the 
two fluorophores and addiag identical amounts of each to the hybridization mixture. 

The output of the photomultiplier tube is digitized using a 12-bit RTI-835H analog-to-digital 
(A/D) conversion board (Analog Devices, Inc., Norwood MA) installed in an mM-compatible PC 
computer. The digitized data are displayed as an image where the signal intensity is mapped using a 
linear 20-color transformation to a pseudocolor scale ranging from blue (low signal) to red (high 
signal). The data is also analyzed quantitatively. Where two different fluorophores are excited and 
measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping 
emission spectra) between the fluorophores using each fluorophore' s emission spectrum. 

A grid is superimposed over the fluorescence signal image such that the signal from each 
spot is centered in each element of the grid. The fluorescence signal within each element is then 
integrated to obtain a numerical value corresponding to the average intensity of the signal. The 
software used for signal analysis is the GEMTOOLS gene expression analysis program (Incyte 
Genomics). Array elements that exhibit at least about a two-fold change in expression, a signal-to- 
background ratio of at least about 2.5, and an element spot size of at least about 40%, are considered 
to be differentially expressed. 
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Expression 

For example, SEQ ID NO:62 showed differential expression associated with lung cancer, as 
determined by microarray analysis. Pair comparisons of lung tumor tissue and microscopically- 
normal tissue from the same donor were made. The expression of SEQ ID NO:62 was increased by at 
least two-fold in lung squamous cell carcinoma tissue from a 66 year-old male as compared to normal 
lung tissue from the same donor (Roy Casfle Intemational Centre for Lung Cancer Research, 
Liverpool. UK). Therefore, SEQ ID NO: 13 is useful in diagnostic and disease staging assays for lung 
cancer and as a potential biological marker or therapeutic agents in the treatment of lung cancer. 

In an alternative example, SEQ ID NO:67 showed differential expression in certain prostate 
carcinoma cell lines versus normal prostate epithelial cells as determined by microarray analysis. 
Three prostate carcinoma cell lines, DU 145, LNCaP, and PC-3 were included in the experiments. 
DU 145 was isolated from a metastatic site in the brain of a 69 year old male with widespread 
metastatic prostate carcinoma. DU 145 has no detectable sensitivity to hormones; forms colonies in 
semi-solid medium; is only weekly positive for acid phosphatase; and cells are negative for prostate 
specific antigen (PSA). LNCaP is a prostate carcinoma cell line isolated from a lymph node biopsy 
of a 50 year old male with metastatic prostate carcinoma. LNCaP expresses PSA, produces prostate 
acid phosphatase, and expresses androgen receptors. PC-3, a prostate adenocarcinoma cell line, was 
isolated from a metastatic site in the bone of a 62 year old male with grade IV prostate 
adenocarcinoma. The normal epithelial cell Ime, PrEC, is a primary prostate epithelial cell line 
isolated from a normal donor. In one experiment, the expression of cDNAs from the prostate 
carcinoma cell lines representing various stages of prostate tumor progression were compared with 
that of the normal prostate epithelial cells under the same culture conditions. The expression of 
cDNAs from the prostate carcinoma cell lines were compared to that of the normal prostate epithelial 
cells grown under the same conditions (in the absence of growth factors and hormones). The 
experiment showed that the expression of SEQ ID NO:67 was increased by at least two fold in 
DU145 prostate carcinoma lines relative to PrECs. Therefore, SEQ ID NO:67 is useful as a 
diagnostic marker or as a potential therapeutic target for certain prostate cancers. 

In an alternative example, SEQ ID NO:69 also showed differential expression in certain 
prostate carcinoma cell Imes versus normal prostate epithelial cells as determmed by microarray 
analysis. The prostate carcinoma cell lines mclude CA-HPV-10, DU 145, LNCaP, and PC-3. CA- 
HPV-7 was derived from cells from a 63 year old male with prostate adenocarcmoma and was 
transformed by transfection with HPV18 DNA. DU 145 was isolated from a metastatic site m the 
brain of a 69 year old male with widespread metastatic prostate carcmoma. DU 145 has no detectable 
sensitivity to hormones; forms colonies m semi-soUd medium; is only weakly positive for acid 
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phosphatase; and cells are negative for prostate specific antigen (PSA). LNCaP is a prostate 
carcinoma cell line isolated from a lymph node biopsy of a 50 year old male with metastatic prostate 
carcinoma. LNCaP expresses PSA, produces prostate acid phosphatase, and expresses androgen 
receptors. PC-3, a prostate adenocarcinoma cell line, was isolated from a metastatic site in the bone 
of a 62 year old male with grade IV prostate adenocarcinoma. The normal epithelial cell line, PZ- 
HPV-7 was derived from epithelial cells cultured from normal tissue from the peripheral zone of the 
prostate. The PZ-HPV-7 cells were transformed by transfection with HPV 18. The microarray 
experiments showed that the expression of SEQ ID NO:69 was decreased by at least two fold in CA- 
HPV-10 prostate carcinoma line relative to cells from the normal prostate epithelial cell line, PZ- 
HPV-7. Therefore, SEQ ID NO: 69 is useful as a diagnostic marker or as a potential therapeutic target 
for certain prostate cancers. 

In an alternative example, the expression of SEQ ID NO:69 was increased by at least 2-fold 
in ovarian tumor tissue when matched with normal tissue from the same donor, a 79-year-old female 
donor with ovarian adenocarcinoma. Matched normal and timiorigenic ovarian tissue samples are 
provided by the Huntsman Cancer Institute, (Salt Lake City, UT). Therefore, SEQ ID NO:69 is 
useful in diagnostic assays and disease staging for ovarian cancer and as a potential biological marker 
and therapeutic agent in the treatment of ovarian cancer. 

In an alternative example, SEQ ID NO:70 showed differential expression in certain breast 
carcinoma cell lines versus a non-malignant mammary epithelial cell line as determined by 
microarray analysis. The non-malignant mammary epithelial cell line, MCFIOA, was isolated from a 
36-year-old female with fibrocystic breast disease. The breast carcinoma cell lines mclude BT20, a 
breast carcinoma cell line derived m vitro from cells emigrating out of thin slices of a tumor mass 
isolated from a 74-year-old female; MCF7, a breast adenocarcinoma cell line derived from the pleural 
effusion of a 69-year-old female; MDA-mb-231, a metastatic breast tumor cell line derived from the 
pleural effusion of a 51-year-old female with metastatic breast carcinoma; SkBR3, a breast 
adenocarcinoma cell line isolated from a malignant pleural effusion of a 43-year-old female; and 
T47D, a breast carcinoma cell Ime derived from a pleural effusion from a 54-year-old female with an 
infiltrating ductal carcinoma of the breast. All cell cultures were propagated in a chemically-defined, 
serum-free medium, H14, according to the supplier* s recommendations and grown to 70-80% 
confluence prior to RNA isolation. The microarray experiments showed that the expression of SEQ 
ID NO:70 was increased by at least two fold in BT20 breast carcinoma line relative to non-malignant 
mammary epithelial cells. Therefore, SEQ ID NO:70 is useful in diagnostic assays for and 
monitoring treatment of certain breast cancers. 
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In an alternative example, SEQ TD NO:94, SEQ E) NO:101, and SEQ ID NO:105, showed 
differential expression associated with breast cancer, as determined by microarray analysis. The gene 
expression profile of a nonmalignant mammary epithelial cell line was compared to the gene 
expression profiles of breast carcinoma lines at different stages of tumor progression. Cell lines 
compared included: a) BT-20, a breast carcinoma cell line derived in vitro from the cells emigrating 
out of thin slices of tumor mass isolated from a 74-year-old female, b) BT-474, a breast ductal 
carcinoma cell line that was isolated from a solid, invasive ductal carcinoma of the breast obtained 
from a 60-year-old woman, c) BT-483, a breast ductal carcinoma cell line that was isolated from a 
papillary invasive ductal tumor obtained from a 23-year-old normal, menstruating, parous female with 
a family history of breast cancer, d) Hs578T, a breast ductal carcinoma cell line isolated from a 74- 
year-old female with breast carcinoma, e) MCF7, a nonmalignant breast adenocarcinoma cell line 
isolated from the pleural effusion of a 69-year-old female, f) MCF-lOA, a breast mammary gland 
(luminal ductal characteristics) cell line isolated, from a 36-year-old woman with fibrocystic breast 
disease, g) MDA-mb-435S, a spindle-shaped strain that evolved from the parent line (435) isolated by 
R. Cailleau from pleural effusion of a 31-year-old female with metastatic, ductal adenocarcinoma of 
the breast, h) Sk-BR-3, a breast adenocarcinoma cell line isolated from a malignant pleural effusion 
of a 43-year-old female, i) T-47D, a breast carcinoma cell hne isolated from a pleural effusion 
obtained from a 54-year-old female with an infiltrating ductal carcinoma of the breast and j) HMEC, .a 
primary breast epithelial cell line isolated from a normal donor. The expression of SEQ ID NO:94 
was decreased by more than two-fold in carcinoma cell lines BT20, Sk-BR-3 and T-47D as compared 
to both HMEC cells and MCFIOA cells. SEQ ID NO: 101 expression was reduced by at least two- 
fold in BT20 and MCF7 cells as compared to HMEC cells. SEQ ID NO: 105 expression was 
downregulated by at least two-fold in the carcinoma cell line BT20 as compared to the HMEC cell 
line. Therefore, SEQ ID NO:94, SEQ ID NO: 101, and SEQ ID NO: 105 are useful m monitoring 
treatment of, and diagnostic assays for breast cancer. 

In an alternative example, SEQ ID NO:83, SEQ ID NO:101, SEQ ID NO:103, and SEQ ID 
NO: 105 showed differential expression associated with lung cancer, as determined by microarray 
analysis. Expression was compared in matched samples of normal and limg tumor tissue from 
individual donors. Tissue samples were provided by the Roy Castle International Centre for Lxmg 
Cancer Research. The expression of SEQ ID NO: 83 was decreased by at least two fold in squamous 
cell carcinoma tissue from a 73-year-old male donor and in lung adenocarcinoma tissue from a 71- 
year-old female donor, as compared to normal tissue from each of the respective donors. The 
expression of SEQ ID NO: 101, SEQ ID NO: 103, and SEQ ID NO: 105 was upregulated by at least 
two-fold in three out of four lung tumor tissue samples as compared to normal tissue from each of the 
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four donors. Therefore, SEQ ID NO:83, SEQ ID NO:101, SEQ ID NO:103, and SEQ ID NO:105 are 
useful in monitoring treatment of, and diagnostic assays for lung cancer. 

In an alternative example, SEQ ID NO: 105 showed differential expression associated with 
colon cancer, as determined by microarray analysis. Matched normal and tumor samples from a 58- 
year-old female diagnosed with mucinous adenocarcinoma (Huntsman Cancer Institute, Salt Lake 
City, UT) were compared by competitive hybridization. SEQ ID NO: 105 expression was upregulated 
by at least two-fold in the tumor tissue sample as compared to the normal tissue sample. Therefore, 
SEQ ID NO: 105 is useful in monitoring treatment of, and diagnostic assays for colon cancer. 

In an alternative example, SEQ ID NO: 106 showed differential expression associated with 
prostate cancer, as determmed by microarray analysis. Primary prostate epithelial cells were 
compared with prostate carcinomas representative of the different stages of tumor progression. Cell 
lines compared included: a)PrEC, a primary prostate epithelial cell line isolated from a normal donor, 
b)DU145, a prostate carcinoma cell line isolated from a metastatic site in the brain of 69-year old 
male with widespread metastatic prostate carcinoma, c)LNCaP, a prostate carcinoma cell line 
isolated from a lymph node biopsy of a 50-year-old male with metastatic prostate carcinoma, and 
d)PC-3, a prostate adenocarcinoma cell line isolated from a metastatic site in the bone of a 62- year- 
old male with grade IV prostate adenocarcinoma. Cells grown under restrictive conditions were 
compared to normal PrECs grown under restrictive conditions. Cells were grown in basal media in 
the absence of growth factors and hormones. The expression of SEQ ID NO: 106 was increased by at 
least two-fold in DU145 cells as compared to PrEC cells. Therefore, SEQ ID NO: 106 is useful in 
monitoring treatment of, and diagnostic assays for prostate cancer. * 

In an alternative example, SEQ ID NO: 106 showed differential expression associated with 
Tangier disease (TD), as determined by microarray analysis. The expression of SEQ ID NO: 106 was 
increased at least eight-fold in Tangier disease-derived fibroblasts compared to normal fibroblasts. In 
addition, both types of cells were cultured in the presence of cholesterol and compared with the same 
cell type cultured in the absence of cholesterol. Human fibroblasts were obtained from skin explants 
from both normal subjects and two patients with homozygous Tangier disease. Cell lines were 
immortalized by transfection with human papillomavirus 16 genes E6 and E7 and a neomycin 
resistance selectable marker. TD-derived ceUs are deficient in an assay of apoA-I mediated tritiated 
cholesterol efflux. Therefore, SEQ ID NO: 106 is useful in monitoring treatment of, and diagnostic 
assays for Tangier disease. 

In an alternative example, SEQ ID NO: 106 showed differential expression associated with 
immune cells, as determined by microarray analysis. THP-1 is promonocyte cell line that was isolated 
from the peripheral blood of a 1-year-old male with acute monocytic leukemia. Upon stimulation with 



106 



wo 03/063688 



PCT/US03/02500 



PMA, THP-1 differentiates into a macrophage-like cell that displays many characteristics of 
peripheral human macrophages. THP-1 cells have been extensively used in the study of signaling in 
human monocytes and the identification of new factors produced by human monocytes. THP-1 cells 
were stmiulated in vitro with soluble PMA, a broad activator of the protein kinase C-dependent 
pathways, and ionomycin, a calcium ionophore that permits the entry of calcium in the cell, for 0.5, 1, 
2, 4, and 8 hours. These treated cells were compared to untreated THP-1 cells kept in culture in the 
absence of stimuU. At the 4 and 8 hour timepoints, the expression of SEQ ID NO: 106 was increased 
by at least 3.5-fold in THP-1 cells treated with PMA and ionomycm as compared to untreated THP-1 
cells. Jurkat cells, from an acute T cell leukemia cell line that grows actively in the absence of 
external stimuli and has been extensively used to study signalmg m human T cells, were treated with 
combinations of graded doses of PMA and ionomycin and collected at a 1 hour time point. In T cells, 
the combination of PMA and ionomycin mimics the type of secondary signaling events elicited during 
optimal B cell activation. The treated cells were compared to untreated Jurkat cells kept in culture in 
the absence of stimuli. The expression of SEQ ID NO: 106 was decreased by at least two-fold in 
Jurkat cells treated with 1 |iM PMA and 1, 10, 50 or 200 jig/ml ionomycin. Therefore, SEQ ID 
NO: 106 may be useful in inonitoring of, and diagnostic assays for immune responses. 

In an alternative example, SEQ ID NO: 102 showed differential expression associated with 
obesity. Human primary subcutaneous preadipocytes were isolated from adipose tissue of a 36-year- 
old healthy female witii a body mass index (BMT) of 27.7. The preadipocytes were cultured and 
induced to differentiate into adipocytes by culturing them in the differentiation medium containing the 
active components, PPAR-y agonist and human insulin. Preadipocytes were treated with human 
insulin and PPAR-y agonist for three days and subsequently were switched to medium containing 
insulin alone for a total duration of five days, 9 days or 12 days before the cells were collected for 
analysis. Differentiated adipocytes were compared to untreated preadipocytes maintained in culture in 
the absence of inducing agents. The expression of SEQ ID NO: 102 was increased at least two-fold in 
treated human adipocytes from an obese donor when compared to non-treated adipocytes from the 
same donor. Thus, SEQ ID NO: 102 is useful for the diagnosis, prognosis, or treatment of diabetes 
mellitus and other disorders such as obesity. 
XII. Complementary Polynucleotides 

Sequences complementary to the PMMM-encoding sequences, or any parts thereof, are used 
to detect, decrease, or inhibit expression of naturally occurring PMMM. Although use of 
oligonucleotides comprising from about 15 to 30 base pairs is described, essentially the same 
procedure is used with smaller or with larger sequence fragments. Appropriate oligonucleotides are 
designed using OLIGO 4.06 software (National Biosciences) and the coding sequence of PMMM. To 
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inhibit transcription, a complementary oligonucleotide is designed from the most unique 5' sequence 
and used to prevent promoter bindmg to the coding sequence. To inhibit translation, a complementary 
oligonucleotide is designed to prevent ribosomal binding to the PMMM-encoding transcript. 
Xm. Expression of PMIVBVI 

Expression and purification of PMMM is achieved using bacterial or virus-based expression 
systems. For expression of PMMM in bacteria, cDNA is subcloned into an appropriate vector 
containing an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA 
transcription. Examples of such promoters include, but are not limited to, the trp-lac {tad) hybrid 
promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac operator regulatory 
element. Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(DE3). 
Antibiotic resistant bacteria express PMMM upon induction with isopropyl beta-D- 
thiogalactopyranoside (IPTG). Expression of PMMM in eukaryotic cells is achieved by infecting 
insect or mammalian cell lines v^ith recombinant A wrograp/iica califomica nuclear polyhedrosis virus 
(AcMNPV), commonly known as baculovtrus. The nonessential polyhedrin gene of baculovirus is 
replaced with cDNA encodmg PMMM by either homologous recombmation or bacterial-mediated 
transposition involving transfer plasmid intermediates. Viral infectivity is maintained and the strong 
polyhedrin promoter drives high levels of cDNA transcription. Recombinant baculovirus is used to 
infect Spodopterafrugiperda (Sf9) insect cells in most cases, or human hepatocytes, in some cases. 
Infection of the latter requires additional genetic modifications to baculovirus (Engelhard, E.K. et al. 
(1994) Proc. Natl. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937- 
1945). 

In most expression systems, PMMM is synthesized as a fusion protein with, e.g., glutathione 
S-transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 
affinity-based purification of recombinant fusion protein fi-om crude cell lysates. GST, a 26- 
kilodalton enzyme from Schistosoma japonicum, enables the purification of fusion proteins on 
immobilized glutathione under conditions that maintain protein activity and antigenicity (Amersham 
Biosciences). Following purification, the GST moiety can be proteolytically cleaved from PMMM at 
specifically engineered sites. FLAG, an 8-amino acid peptide, enables immunoaffmity purification 
using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak). 6- 
His, a stretch of six consecutive histidine residues, enables purification on metal-chelate resins 
(QIAGEN). Methods for protein expression and purification are discussed in Ausubel et al. (jwpra, 
ch. 10 and 16). Purified PMMM obtained by these methods can be used directly in the assays shown 
in Examples XVn, XVm, XIX, and XX, where applicable. 
XIV. Functional Assays 
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PMMM function is assessed by expressing the sequences encoding PMMM at physiologically 
elevated levels in mammalian cell culture systems. cDNA is subcloned into a mammalian expression 
vector containing a strong promoter that drives high levels of cDNA expression. Vectors of choice 
include PCMV SPORT plasmid (Invitrogen, Carlsbad CA) and PCR3.1 plasmid (Invitrogen), both of 
which contain the cytomegalovirus promoter. 5-10 jjLg of recombinant vector are transiently 
transfected into a human cell Ime, for example, an endothelial or hematopoietic cell line, using either 
liposome formulations or electroporation. 1-2 //g of an additional plasmid contaming sequences 
encoding a marker protem are co-transfected. Expression of a marker protein provides a means to 
distmguish transfected cells from nontransfected cells and is a reliable predictor of cDNA expression 
from the recombinant vector. Marker proteins of choice include, e.g.. Green Fluorescent Protein 
(GFP; Clontech), CD64, or a CD64-GFP fusion protein. How cytometry (FCM), an automated, laser 
optics-based technique, is used to identify transfected cells expressing GFP or CD64-GFP and to 
evaluate the apoptotic state of the cells and other cellular properties. FCM detects and quantifies the 
uptake of fluorescent molecules that diagnose events preceding or coincident with cell death. These 
events include changes in nuclear DNA content as measured by staining of DNA with propidium 
iodide; changes in cell size and granularity as measured by forward light scatter and 90 degree side 
light scatter; down-regulation of DNA synthesis as measured by decrease in bromodeoxy uridine 
uptake; alterations in expression of cell surface and intracellular proteins as measured by reactivity 
with specific antibodies; and alterations in plasma membrane composition as measured by the binding 
of fluorescein-conjugated Annexin V protein to the cell surface. Methods in flow cytometry are 
discussed in Ormerod, M.G. (1994; Flow Cytometry . Oxford, New York NY). 

The influence of PMMM on gene expression can be assessed using highly purified 
populations of cells transfected with sequences encoding PMMM and either CD64 or CD64-GFP. 
CD64 and CD64-GFP are expressed on the surface of transfected cells and bind to conserved regions 
of human immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected 
cells using magnetic beads coated with either human IgG or antibody agamst CD64 (DYNAL, Lake 
Success NY). mRNA can be purified from the cells using methods well known by those of skill in the 
art. Expression of mRNA encoding PMMM and other genes of interest can be analyzed by northern 
analysis or microarray techniques. 
XV- Production of PMMM Specific Antibodies 

PMMM substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 
Harrington, M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is used to 
immunize animals (e.g., rabbits, mice, etc.) and to produce antibodies using standard protocols. 
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Alternatively, the PMMM amino acid sequence is analyzed using LASERGENE software 
(DNASTAR) to detennine regions of high immunogenicity, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means known to those of skill in the art. Methods for 
selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well 
described in the art (Ausubel et al., supra^ ch. 1 1). 

Typically, oligopeptides of about 15 residues in length are synthesized using an ABI 431 A 
peptide synthesizer (Applied Biosystems) using FMOC chemistry and coupled to KLH (Sigma- 
Aldrich, St. Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) to 
increase immunogenicity (Ausubel et al., supra). Rabbits are immunized with the oligopeptide-KLH 
complex in complete Freund's adjuvant. Resulting antisera are tested for antipeptide and anti-PMMM 
activity by. for example, binding the peptide or PMMM to a substrate, blocking with 1% BSA, 
reacting with rabbit antisera, washing, and reacting with radio-iodinated goat anti-rabbit IgG. 
XVI. Purification of Naturally Occurring PMMM Using Specific Antibodies 

Naturally occurring or recombinant PMMM is substantially purified by immunoaffinity 
chromatography using antibodies specific for PMMM. An immunoaffinity column is constructed by 
covalently coupling anti-PMMM antibody to an activated chromatographic resin, such as 
CNBr-activated SEPHAROSE (Amersham Biosciences). After the coupling, the resin is blocked and 
washed according to the manufacturer's instructions. 

Media containing PMMM are passed over the immunoaffinity column, and the column is 
washed under conditions that allow the preferential absorbance of PMMM (e.g., high ionic strength 
buffers in the presence of detergent). The coliunn is eluted under conditions that disrupt 
antibody/PMMM binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, such 
as urea or thiocyanate ion), and PMMM is collected. 
XVn. Identification of Molecules Which Interact with PMMM 

PMMM, or biologically active fragments thereof, are labeled with Bolton-Hunter reagent 
(Bolton, A.E. and W.M. Hunter (1973) Biochem. J. 133:529-539). Candidate molecules previously 
arrayed in the wells of a multi-well plate are mcubated with the labeled PMMM, washed, and any 
wells with labeled PMMM complex are assayed. Data obtamed using different concentrations of 
PMMM are used to calculate values for the number, affinity, and association of PMMM with the 
candidate molecules. 

Alternatively, molecules interacting with PMMM are analyzed using the yeast two-hybrid 
system as described m Fields, S. and O. Song (1989; Nature 340:245-246), or using conamercially 
avaHable kits based on the two-hybrid system, such as the MATCHMAKER system (Clontech). 
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PMMM may also be used in the PATHCALLD^TG process (CuraGen Corp., New Haven CT) 
which employs the yeast two-hybrid system in a high-throughput manner to determine all interactions 
between the proteins encoded by two large libraries of genes (Nandabalan, K. et al. (2000) U.S. Patent 
No. 6,057,101). 

XVIII. Demonstration of PMMM Activity 

PMMM activity can be demonstrated using a generic immunoblotting strategy or through a 
variety of specific activity assays, some of which are outlined below. As a general approach, cell lines 
or tissues transformed with a vector containing PMMM coding sequences can be assayed for PMMM 
activity by immunoblotting. Transformed cells are denatured in SDS in the presence of b- 
mercaptoethanol, nucleic acids are removed by ethanol precipitation, and proteins are purified by 
acetone precipitation. Pellets are resuspended in 20 mM Tris buffer at pH 7.5 and incubated with 
Protein G-Sepharose pre-coated with an antibody specific for PMMM. After washing, the Sepharose 
beads are boiled in electrophoresis sample buffer, and the eluted proteins subjected to SDS-PAGE. 
The SDS-PAGE is transferred to a membrane for immunoblotting, and the PMMM activity is assessed 
by visualizing and quantifying bands on the blot using the antibody specific for PMMM as the primary 
antibody and ^^^I-labeled IgG specific for the primary antibody as the secondary antibody. 

PMMM kinase activity is measured by quantifying the phosphorylation of a protein substrate 
by PMMM in the presence of gamma-labeled ^^P-ATP. PMMM is incubated with the protein 
substrate, ^^P-ATP, and an appropriate kinase buffer. The ^^P incorporated into the substrate is 
separated from free ^^P-ATP by electrophoresis and the incorporated ^^P is counted using a 
radioisotope counter. The amount of incorporated ^^P is proportional to the activity of PMMM. A 
determination of the specific amino acid residue phosphorylated is made by phosphoamino acid 
analysis of the hydrolyzed protein. 

PMMM phosphatase activity is measured by the hydrolysis of p-nitrophenyl phosphate 
(PNPP). PMMM is incubated together with PNPP m HEPES buffer, pH 7.5, in the presence of 0.1% 
p-mercaptoethanol at 37 °C for 60 min. The reaction is stopped by the addition of 6 ml of 10 N NaOH 
and the increase in light absorbance at 410 nm resulting from the hydrolysis of PNPP is measured 
using a spectrophotometer. The increase in light absorbance is proportional to the activity of PMMM 
in the assay (Diamond, R.H. et al. (1994) Mol. Cell. Biol. 14:3752-3762). 

In the alternative, PMMM phosphatase activity is determined by measuring the amount of 
phosphate removed from a phosphorylated protein substrate. Reactions are performed with 2 or 4 nM 
enzyme in a final volume of 30 /il containing 60 mM Tris, pH 7.6, 1 mM EDTA, 1 mM EGTA, 0.1% 
2-mercaptoethanol and 10 fiM substrate, ^^P-labeled on serine/threonine or tyrosine, as appropriate. 
Reactions are initiated with substrate and incubated at 30° C for 10-15 min. Reactions are quenched 

in 
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with 450 jitl of 4% (w/v) activated charcoal in 0.6 M HCl, 90 inM Na4P207, and 2 mM NaHjPO*, then 
centrifuged at 12,000 x g for 5 min. Acid-soluble ^^i is quantified by liquid scintillation counting 
(Sinclair, C. et al. (1999) J. Biol. CSiem. 274:23666-23672). 

PMMM protease activity is measured by the hydrolysis of appropriate synthetic peptide 
substrates conjugated with various chromogenic molecules in which the degree of hydrolysis is 
quantified by spectrophotometric (or fluorometric) absorption of the released chromophore (Beynon, 
R.J. and J.S. Bond n 994 ^ Proteolytic Enzymes: A Practical Approach, Oxford University Press, New 
York, NY, pp. 25-55). Peptide substrates are designed according to the category of protease activity 
as endopeptidase (serine, cysteine, aspartic proteases, or metalloproteases), aminopeptidase (leucine 
aminopeptidase), or carboxypeptidase (carboxypeptidases A and B, procollagen C-proteinase). 
Commonly used chromogens are 2-naphthylamine. 4-nitroaniline, and furylacrylic acid. Assays are 
performed at ambient temperature and contain an aliquot of the enzyme and the appropriate substrate 
in a suitable buffer. Reactions are carried out in an optical cuvette, and the increase/decrease in 
absorbance of the chromogen released during hydrolysis of the peptide substrate is measured. The 
change in absorbance is proportional to the enzyme activity in the assay. 

In the alternative, an assay for PMMM protease activity takes advantage of fluorescence 
resonance energy transfer (FRET) that occurs when one donor and one acceptor fiuorophore with an 
appropriate spectral overlap are in close proximity. A flexible peptide linker containing a cleavage 
site specific for PMMM is fused between a red-shifted variant (RSGPT4) and a blue variant (BFPS) of 
Green Fluorescent Protein. This fusion protein has spectral properties that suggest energy transfer is 
occurring from BFPS to RSGFP4. When the fusion protein is incubated with PMMM, the substrate is 
cleaved, and the two fluorescent proteins dissociate. This is accompanied by a marked decrease in 
energy transfer which is quantified by comparing the enoission spectra before and after the addition of 
PMMM (Mitra, R.D. et al (1996) Gene 173:13-17). This assay can also be performed in living cells. 
In this case the fluorescent substrate protein is expressed constitutively in cells and PMMM is 
introduced on an inducible vector so that FRET can be monitored in the presence and absence of 
PMMM (Sagot, I. et al (1999) FEES Letters 447:53-57). 

An assay for ubiquitin hydrolase activity measures the hydrolysis of a ubiquitin precursor. 
The assay is performed at ambient temperature and contains an aliquot of PMMM and the appropriate 
substrate in a suitable buffer. Chemically synthesized human ubiquitin-valine may be used as 
substrate. Cleavage of the C-termmal valine residue from the substrate is monitored by capillary 
electrophoresis (FrankHn, K. et al. (1997) Anal. Biochem. 247:305-309). 

PMMM protease inhibitor activity for alpha 2-HS-glycoprotein (AHSG) can be measured as a 
decrease in osteogenic activity in dexamethasone-treated rat bone marrow cell cultures (dex-RBMC). 
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Assays are carried out in 96-well culture plates containing minimal essential medium supplemented 
with 15% fetal bovine serum, ascorbic acid (50 mg/ml), antibiotics (100 mg/ml penicillin G, 50 mg/ml 
gentamicin, 0.3 mg/ml fungizone), 10 mM B-glycerophosphate, dexamethasone (10"* M) and various 
concentrations of PMMM for 12-14 days. Mineralized tissue formation in the cultures is quantified 
by measuring the absorbance at 525 nm using a 96-well plate reader (Binkert, C. et al. (1999) J. Biol. 
Chem. 274:28514-28520). 

PMMM protease inhibitor activity for inter-alpha-trypsin inhibitor (ITI) can be measured by a 
continuous spectrophotometric rate determination of trypsin activity. The assay is performed at 
ambient temperature in a quartz cuvette in pH 7.6 assay buffer containing 63 mM sodium phosphate, 
0.23 mM N a-benzoyle-L-arginine ethyl ester, 0.06 mM hydrochloric acid, 100 units trypsin, and 
various concentrations of PMMM. Immediately after mixing by inversion, the increase in A253 is 
recorded for approximately 5 minutes and the enzyme activity is calculated (Bergmeyer, H.U. et al. 
(1974) Meth. Enzym. Anal. 1:515-516). 

PMMM isomerase activity such as peptidyl prolyl cis/trans isomerase activity can be assayed 
by an enzyme assay described by Rahfeld, J.U„ et al. (1994; FEES Lett. 352:180-184). The assay is 
performed at lO^'C in 35 mM HEPES buffer, pH 7.8, containing chymotrypsin (0.5 mg/ml) and 
PMMM at a variety of concentrations. Under these assay conditions, the substrate, Suc-Ala-Xaa-Pro- 
Phe-4-NA, is in equilibrium with respect to the prolyl bond, with 80-95% in trans and 5-20% in cis 
conformation. An aliquot (2 ml) of the substrate dissolved in dimethyl sulfoxide (10 mg/ml) is added 
to the reaction mixture described above. Only the cis isomer of the substrate is a substrate for 
cleavage by chymotrypsin. Thus, as the substrate is isomerized by PMMM, the product is cleaved by 
chymotrypsin to produce 4-nitroanilide, which is detected by it's absorbance at 390 nm. 4-nitroanilide 
appears in a time-dependent and a PMMM concentration-dependent manner. 

PMMM galactosyltransferase activity can be determined by measuring the transfer of 
radiolabeled galactose from UDP-galactose to a GlcNAc-terminated oligosaccharide chain (Kolbinger, 
F. et al. (1998) J. Biol. Chem. 273:58-65). The sample is incubated with 14 ill of assay stock solution 
(180 mM sodium cacodylate, pH 6.5, 1 mg/ml bovine serum albumin, 0.26 mM UDP-galactose, 2 pi 
of UDP-pH]galactose), 1 jtil of MnCl^ (500 mM), and 2.5 /xl of GlcNAcpO-(CHD8-C02Me (37 mg/ml 
in dimethyl sulfoxide) for 60 minutes at 37 °C. The reaction is quenched by the addition of 1 ml of 
water and loaded on a C18 Sep-Pak cartridge (Waters), and the column is washed twice widi 5 ml of 
water to remove unreacted UDP-PH]galactose. The PHJgalactosylated GlcNAcpO-(CH2)8-C02Me 
remains bound to the column during the water washes and is eluted with 5 ml of methanol. 
Radioactivity in the eluted material is measured by liquid scintillation counting and is proportional to 
galactosyltransferase activity in the starting sample. 
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PMMM induction by heat or toxins may be demonstrated using primary cultures of human 
fibroblasts or human cell lines such as CCL-13, HEK293, or HEP G2 (ATCC). To heat induce 
PMMM expression, aliquots of cells are incubated at 42°C for 15, 30, or 60 minutes. Control aliquots 
are incubated at 37°C for the same time periods. To induce PMMM expression by toxins, aliquots of 
cells are treated with 100 fiM arsenite or 20 mM a2etidine-2-carboxylic acid for 0, 3, 6, or 12 hours. 
After exposure to heat, arsenite, or the amino acid analogue, samples of the treated cells are harvested 
and cell ly sates prepared for analysis by western blot. Cells are lysed in lysis buffer containing 1% 
Nonidet P-40, 0.15 M NaCl, 50 mM Tris-HCl, 5 mM EDTA, 2 mM N-ethylmaleimide, 2 mM 
phenylmethylsulfonyl fluoride, 1 mg/ml leupeptin, and 1 mg/ml pepstatin. Twenty micrograms of the 
cell lysate is separated on an 8% SDS-PAGE gel and transferred to a membrane. After blocking with 
5% nonfat dry milk/phosphate-buffered saline for 1 h, the membrane is incubated overnight at 4°C or 
at room temperature for 2-4 hours with an appropriate dilution of anti-PMMM serum in 2% nonfat dry 
milk/phosphate-buffered saline. The membrane is then washed and incubated with a 1:1000 dilution 
of horseradish peroxidase-conjugated goat anti-rabbit IgG in 2% dry milk/phosphate-buffered saline. 
After washing with 0.1% Tween 20 in phosphate-buffered saline, the PMMM protein is detected and 
compared to controls using chemiliuninescence. 

PMMM lysyl hydroxylase activity is determined by measuring the production of 
hydroxy [^^Cjlysine from [^'*C] lysine. Radiolabeled protocollagen is incubated with PMMM in buffer 
containing ascorbic acid, iron sulfate, dithiothreitol, bovine serum albumin, and catalase. Following a 
30 miQute incubation, the reaction is stopped by the addition of acetone, and centrifuged. The 
sedimented material is dried, and the hydroxyl^'^Cjlysine is converted to [^'*C]formaldehyde by 
oxidation with periodate, and then extracted into toluene. The amount of ^"^C extracted into toluene is 
quantified by scintillation counting, and is proportional to the activity of PMMM in the sample 
(Kivirikko, K., and R. Myllyla (1982) Methods Enzymol. 82:245-304). 
XIX. Identificatioii of PMMM Substrates 

Phage display libraries can be used to identify optimal substrate sequences for PMMM. A 
random hexamer followed by a linker and a known antibody epitope is cloned as an N-terminal 
extension of gene III in a filamentous phage library. Gene III codes for a coat protein, and the epitope 
will be displayed on the surface of each phage particle. The library is incubated with PMMM under 
proteolytic conditions so that the epitope will be removed if the hexamer codes for a PMMM cleavage 
site. An antibody that recognizes the epitope is added along with immobilized protein A. Uncleaved 
phage, which still bear the epitope, are removed by centrifugation. Phage in the supernatant are then 
amplified and undergo several more romds of screening. Individual phage clones are then isolated 
and sequenced. Reaction kinetics for these peptide substrates can be studied using an assay in 
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Example XVm, and an optimal cleavage sequence can be derived (Ke, S.H. et al. (1997) J. Biol. 
Chem. 272:16603-16609). 

To screen for in vivo PMMM substrates, this method can be expanded to screen a cDNA 
expression library displayed on the surface of phage particles (T7SELECT 10-3 Phage display vector, 
Novagen, Madison, WI) or yeast cells (pYDl yeast display vector kit, fcvitrogen, Carlsbad, CA). In 
this case, entire cDNAs are fiised between Gene HI and the appropriate epitope. 
XX. Identification of PMMM Inhibitors 

Compounds to be tested are arrayed in the wells of a multi-well plate in varying 
concentrations along with an appropriate buffer and substrate, as described in the assays in Example 
XVIU. PMMM activity is measured for each well and the ability of each compound to inhibit PMMM 
activity can be determined, as well as the dose-response kinetics: This assay could also be used to 
identify molecules which enhance PMMM activity. 

In the alternative, phage display libraries can be used to screen for peptide PMMM inhibitors. 
Candidates are found among peptides which bind tightly to a protease. In this case, multi-well plate 
wells are coated with PMMM and incubated with a random peptide phage display library or a cyclic 
peptide library (Koivunen, E. et al. (1999) Nature Biotech 17:768-774). Unbound phage are washed 
away and selected phage amplified and rescreened for several more rounds. Candidates are tested for 
PMMM inhibitory activity using an assay described in Example XVHI. 

Various modifications and variations of the described compositions, methods, and systems of 
the invention will be apparent to those skilled in the art without departing from the scope and spirit of 
the invention. It will be appreciated that the invention provides novel and useful protems, and their 
encoding polynucleotides, which can be used in the drug discovery process, as well as methods for 
using these compositions for the detection, diagnosis, and treatment of diseases and conditions. 
Although the invention has been described in connection with certain embodiments, it should be 
understood that the invention as claimed should not be unduly limited to such specific embodiments. 
Nor should the description of such embodiments be considered exhaustive or limit the invention to the 
precise forms disclosed. Furthermore, elements from one embodiment can be readily recombined with 
elements from one or more other embodiments. Such combinations can form a number of 
embodiments within the scope of the invention. It is intended that the scope of the invention be 
defined by the following claims and their equivalents. 
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Incyte Full Length Clones 


90I27811CA2 | 


90127934CA2 


5299346CA2, 90043663CA2, 90043779CA2, 95102091CA2 


90040245CA2 


1241883CA2, 90I16231CA2, 90116378CA2, 90177975CA2, 
90178051CA2, 90178095CA2, 901781 11CA2, 901781 19CA2 


90134942CA2 I 


90153388CA2 I 


90135090CA2 I 


90134126CA2 I 


90134234CA2 


95137671CA2 






4921532CA2, 6824346CA2, 90005372CA2 | 








90009616CA2 












90134242CA2 | 






2678676CA2, 2679358CA2, 3608542CA2, 4331458CA2, 
980724CA2 


Incyte 

Polynucleotide 
ID 


7509341CB1 


7509367CB1 


7500455CB1 | 


7510401CB1 


7504702CB1 


7509113CB1 


7509140CB1 


7509223CB1 


7509272CB1 


7509327CB1 


7504677CB1 


7504534CB1 


7507771CB1 


7504732CB1 


950917CB1 


7459720CB1 


7503300CB1 


7503334CB1 


7503341CB1 


7509936CB1 


7509986CB1 


7510O10CB1 


7510056CB1 


7510398CB1 


7510498CB1 


7510044CB1 


7504509CB1 


Polynucleotide 
SEQIDNO: 


OO 


oo 


cn 
oo 
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lO 
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OO 
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OS 
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CO 
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T— » 


Incyte 

Polypeptide E) 


7509341CD1 


7509367CD1 


7500455CD1 


7510401CD1 


7504702CD1 


75091 13CD1 1 


7509140CD1 


7509223CD1 


7509272CD1 


7509327CD1 


7504677CD1 


7504534CD1 


7507771CD1 


7504732CD1 


950917CD1 : 


7459720CD1 


7503300CD1 


7503334CD1 


7503341CD1 


7509936CD1 


7509986CD1 


7510010CD1 


7510056CD1 


7510398CD1 


7510498CD1 


7510044CD1 


7504509CD1 


Polypeptide 
SEQLDNO: 
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to 
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CO 
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CO 
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CO 


OO 
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0\ 
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OO 




Incyte Project ID 


7509341 


7509367 


7500455 


7510401 


7504702 


7509113 


7509140 


7509223 


7509272 


7509327 


7504677 


7504534 


7507771 


7504732 


950917 


7459720 


7503300 


17503334 


7503341 


7509936 


17509986 


17510010 


7510056 


7510398 


7510498 


7510044 


7504509 
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What is claimed is: 

1. An isolated polypeptide selected from the group consisting of: 

a) a polypeptide comprising an amino acid sequence selected from the group consisting 
ofSEQIDNO:l-58, 

b) a polypeptide comprising a naturally occurring amino acid sequence at least 90% 
identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO:3, SEQ ID NO:6-7, SEQ ID NO:10-12, SEQ ID NO:15, SEQ ID NO:17-18, SEQ 
ID NO:21, SEQ ID NO:23"25, SEQ ID NO:27, SEQ ID NO:33, SEQ ID NO:35, SEQ 
ID NO:37, SEQ ID NO:41^2, SEQ ID NO:45, SEQ ID NO:50, and SEQ ID NO:57, 

c) a polypeptide comprising a naturaUy occurring amino acid sequence at least 95% 
identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO:2, SEQ ID NO:5, SEQ ID NO: 19, and SEQ ID NO:58, 

d) a polypeptide comprising a naturally occxuxing amino acid sequence at least 93% 
identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO:8 and SEQ ID NO:32, 

e) a polypeptide comprising a naturally occurring amino acid sequence at least 99% 
identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO: 13, SEQ ID NO:26, and SEQ ID NO:36, 

f) a polypeptide comprising a naturally occurring amino acid sequence at least 91% 
identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO: 16. SEQ ID NO: 47, and SEQ ID NO:56, 

g) a polypeptide comprising a naturaUy occurring amino acid sequence at least 92% 
identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO:49 and SEQ ID NO:56, 

h) a polypeptide comprising a naturally occurring amino acid sequence at least 96% 
identical to the amino acid sequence of SEQ ID NO:44, 

i) a polypeptide comprising a naturally occurring amino acid sequence at least 97% 
identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO:40 and SEQ ID NO:48, 

j) a polypeptide comprising a naturally occurring amino acid sequence at least 98% 
identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO:54-55, 
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k) a polypeptide consisting essentially of a naturally occurring amino acid sequence at 
least 90% identical to an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1, SEQ ID NO:4, SEQ ID NO:9, SEQ ID NO: 14, SEQ ID NO:20, SEQ 
ID NO:22, SEQ ID NO:28-31, SEQ ID NO:34, SEQ K) NO:38-39, SEQ ID NO:43, 
SEQ ID NO:46, and SEQ ID NO:51-53, 

1) a biologically active fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-58, and 

m) an immunogenic fragment of a polypeptide having an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-58. 

2. An isolated polypeptide of claim 1 comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-58. 

3. An isolated polynucleotide encoding a polypeptide of claim 1. 

4. An isolated polynucleotide encoding a polypeptide of claim 2. 

5. An isolated polynucleotide of claim 4 comprising a polynucleotide sequence selected from 
the group consisting of SEQ ID NO:59-116. 

6. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide of claim 3. 

7. A cell transformed with a recombinant polynucleotide of claim 6. 

8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 

9. A method of producing a polypeptide of claim 1, the method comprising: 

a) culturing a cell under conditions suitable for expression of the polypeptide, wherein 
said cell is transformed with a recombinant polynucleotide, and said recombinant 
polynucleotide comprises a promoter sequence operably linked to a polynucleotide 
encoding the polypeptide of claim 1, and 

b) recovering the polypeptide so expressed. 
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10. A method of claim 9, wherein the pol3^eptide comprises an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-58. 

11. An isolated antibody which specifically binds to a polypeptide of claim 1. 

12. An isolated polynucleotide selected jfrom the group consisting of: 

a) a polynucleotide comprising a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:59-116, 

b) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
90% identical to a polynucleotide sequence selected from the group consisting of SEQ 
ID NO:61-66, SEQ ID NO:68-70, SEQ ID NO:73-75, SEQ ID NO:79, SEQ ID 
NO:81-82, SEQ ID NO:84, SEQ ID NO:88, SEQ ID NO:91-93, SEQ ID NO:95, SEQ 
ID NO:98, SEQ ID NO: 1 1 1, and SEQ ID NO: 1 15-1 16, 

c) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
98% identical to a polynucleotide sequence selected from the group consisting of SEQ 
ID NO:59 and SEQ ID NO: 101-102, 

d) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
95% identical to a polynucleotide sequence selected from the group consisting of SEQ 
ID NO:60 and SEQ ID NO: 103, 

e) a polynucleotide comprising a naturally occimring polynucleotide sequence at least 
99% identical to a polynucleotide sequence selected from the group consisting of SEQ 
ID NO:67. SEQ ID NO:72, SEQ ID NO:96, and SEQ ID NO:99-100, 

f) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
97% identical to a polynucleotide sequence selected from the group consisting of SEQ 
ID NO:71, SEQ ID NO:87, SEQ ID NO:97. and SEQ ID NO:105, 

g) a polynucleotide comprising a naturally occurring poljniucleotide sequence at least 
91% identical to a polynucleotide sequence selected from the group consisting of SEQ 
ID NO: 104 and SEQ ID NO: 1 13, 

h) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
92% identical to a polynucleotide sequence selected from the group consisting of SEQ 
ID NO: 109-1 10, 

i) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
93% identical to the polynucleotide sequence of SEQ ID NO: 108, 
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j) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 

94% identical to a polynucleotide sequence selected from the group consisting of SEQ 
ID NO: 107 and SEQ ID NO: 1 14, 

k) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 

96% identical to a polynucleotide sequence selected from the group consisting of SEQ 
ID NO: 106, SEQ ID NO:86, SEQ ID NO:94, and SEQ ID NO: 1 12, 

1) a polynucleotide consisting essentially of a naturally occurring polynucleotide 
sequence at least 90% identical to a polynucleotide sequence selected from the group 

consisting of SEQ ID NO:76-78, 

m) a polynucleotide complementary to a polynucleotide of a), 

n) a polynucleotide complementary to a polynucleotide of b), 

o) a polynucleotide complementary to a polynucleotide of c), 

p) a polynucleotide complementary to a polynucleotide of d), 

q) a polynucleotide complementary to a polynucleotide of e>, 

r) a polynucleotide complementary to a polynucleotide of f), 

s) a polynucleotide complementary to a polynucleotide of g), 

t) a polynucleotide complementary to a polynucleotide of h), 

u) a polynucleotide complementary to a polynucleotide of i), 

v) a polynucleotide complementary to a polynucleotide of j), 

w) a polynucleotide complementary to a polynucleotide of k), 

x) a polynucleotide complementary to a polynucleotide of 1), and 

y) an RNA equivalent of a)-x). 

13. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 
polynucleotide of claim 12. 

14. A method of detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 12, the method comprising: 

a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sample, 
and which probe specifically hybridizes to said target polynucleotide, under 
conditions whereby a hybridization complex is formed between said probe and said 
target polynucleotide or fragments thereof, and 
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b) detecting the presence or absence of said hybridization complex, and, optionally, if 
present, the amount thereof. 

15. A method of claim 14, wherein the probe comprises at least 60 contiguous nucleotides. 

16. A method of detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 12, the method comprising: 

a) amplifying said target polynucleotide or fragment thereof using polymerase chain 
reaction amplification, and 

b) detecting the presence or absence of said amplified target polynucleotide or fragment 
thereof, and, optionally, if present, the amount thereof. 

17. A composition comprising a polypeptide of claim 1 and a pharmaceutically acceptable 
excipient. 

18. A composition of claim 17, wherein the polypeptide comprises an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-58. 

19. A method for treating a disease or condition associated with decreased expression of 
functional PMMM, comprising administering to a patient in need of such treatment the composition of 
claim 17. 

20. A method of screening a compound for effectiveness as an agonist of a polypeptide of 
claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting agonist activity in the sample. 

21. A composition comprising an agonist compoimd identified by a method of claim 20 and a 
pharmaceutically acceptable excipient. 

22. A method for treating a disease or condition associated with decreased expression of 
functional PMMM, comprising administering to a patient in need of such treatment a composition of 
claim 21. 
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23. A method of screening a compound for effectiveness as an antagonist of a polypeptide of 
claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting antagonist activity in the sample. 

24. A composition comprising an antagonist compound identified by a method of claim 23 
and a pharmaceutically acceptable excipient. 

25. A method for treating a disease or condition associated with overexpression of functional 
PMMM, comprising administering to a patient in need of such treatment a composition of claim 24. 

26. A method of screening for a compound that specifically binds to the polypeptide of claim 
1 , the method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound under suitable 
conditions, and 

b) detecting binding of the polypeptide of claim 1 to the test compound, thereby 
identifying a compound that specifically binds to the polypeptide of claim 1. 

27. A method of screening for a compound that modulates the activity of the polypeptide of 
claim 1, the method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound under 
conditions permissive for the activity of the polypeptide of claim 1, 

b) assessing the activity of the polypeptide of claim 1 in the presence of the test 
compound, and 

c) comparing the activity of the polypeptide of claim 1 in the presence of the test 
compound with the activity of the polypeptide of claim 1 in the absence of the test 
compound, wherein a change in the activity of the polypeptide of claim 1 in the 
presence of the test compound is indicative of a compound that modulates the activity 
of the polypeptide of claim 1. 

28. A method of screening a compound for effectiveness in altering expression of a target 
polynucleotide, wherein said target polynucleotide comprises a sequence of claim 5, the method 
comprising: 
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a) exposing a sample comprising the target polynucleotide to a compound, under 
conditions suitable for the expression of the target polynucleotide, 

b) detecting altered expression of the target polynucleotide, and 

c) comparing the expression of the target polynucleotide in the presence of varying 
amounts of the compound and in the absence of the compound. 

29. A method of assessing toxicity of a test compound, the method comprising: 

a) treating a biological sample containing nucleic acids with the test compound, 

b) hybridizing the nucleic acids of the treated biological sample with a probe comprising 
at least 20 contiguous nucleotides of a polynucleotide of claim 12 under conditions 
whereby a specific hybridization complex is fomaed between said probe and a target 
polynucleotide in the biological sample, said target polynucleotide comprising a 
polynucleotide sequence of a polynucleotide of claim 12 or fragment thereof, 

c) quantifying the amount of hybridization complex, and 

d) comparing the amount of hybridization complex in the treated biological sample with 
the amount of hybridization complex in an untreated biological sample, wherein a 
difference in the amount of hybridization complex in the treated biological sample is 
indicative of toxicity of the test compound. 

30. A method for a diagnostic test for a condition or disease associated with the expression of 
PMMM in a biological sample, the method conq)rising: 

a) combining the biological sample with an antibody of claim 11, under conditions 
suitable for the antibody to bind the polypeptide and form an antibodyipolypeptide 
complex, and 

b) detecting the complex, wherein the presence of the complex correlates with the 
presence of the polypeptide in the biological sample. 

31. The antibody of claim 11, wherein the antibody is: 

a) a chimeric antibody, 

b) a single chain antibody, 

c) a Fab fragment, 

d) a F(ab*)2 fragment, or 

e) a humanized antibody. 

32. A composition comprising an antibody of claim 1 1 and an acceptable excipient. 



279 



wo 03/063688 



PCT/US03/02500 



33. A method of diagnosing a condition or disease associated with the expression of PMMM 
in a subject, comprising administering to said subject an effective amount of the composition of claim 
32. 

34. A composition of claim 32, further comprising a label. 

35. A method of diagnosing a condition or disease associated with the expression of PMMM 
in a subject, comprising administering to said subject an effective amount of the composition of claim 
34. 

36. A method of preparing a polyclonal antibody with the specificity of the antibody of claim 
11, the method comprising: 

a) immunizing an animal with a polypeptide consisting of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-58, or an immunogenic fragment 
thereof, under conditions to elicit an antibody response, 

b) isolating antibodies from the animal, and 

c) . screening the isolated antibodies with the polypeptide, thereby identifying a 

polyclonal antibody which specifically binds to a polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-58. 

37. A polyclonal antibody produced by a method of claim 36. 

38. A composition comprising the polyclonal antibody of claun 37 and a suitable carrier. 

39. A method of making a monoclonal antibody with the specificity of the antibody of claim 
1 1, the method comprising: 

a) immunizing an animal with a polypeptide consisting of an andno acid sequence 
selected from the group consisting of SEQ ID NO: 1-58, or an immunogenic fragment 
thereof, under conditions to elicit an antibody response, 

b) isolating antibody producing cells from the animal, 

c) fusing the antibody producing cells with immortalized cells to form monoclonal 
antibody-producing hybridoma cells, 

d) culturing the hybridoma cells, and 
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e) isolating from the culture monoclonal antibody which specifically binds to a 

polypeptide comprising an amino acid sequence selected from the group consisting of 
SEQIDNO:l-58. 

40. A monoclonal antibody produced by a method of claim 39. 

41. A composition comprising the monoclonal antibody of claim 40 and a suitable carrier. 

42. The antibody of claim 1 1, wherein the antibody is produced by screening a Fab 
expression library. 

43. The antibody of claim 11, wherein the antibody is produced by screening a recombinant 
immunoglobulin library. 

44. A method of detecting a polypeptide comprising an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-58 in a sample, the method comprising: 

a) incubating the antibody of claim 1 1 with the sample under conditions to allow 
specific binding of the antibody and the polypeptide, and 

b) detecting specific binding, wherein specific binding indicates the presence of a 
polypeptide comprising an amino acid sequence selected from the group consisting of 
SEQ JD NO: 1-58 in the sample. 

45. A method of purifying a polypeptide comprising an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-58 from a sample, the method comprising: 

a) incubating the antibody of claim 1 1 with the sample under conditions to allow 
specific binding of the antibody and the polypeptide, and 

b) separating the antibody from the sample and obtaining the purified polypeptide 
comprising an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-58. 

46. A microarray wherein at least one element of the microarray is a polynucleotide of claim 

13. 

47. A method of generating an expression profile of a sample which contains polynucleotides, 
the method comprising: 
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a) labeling the polynucleotides of the sample, 

b) contacting the elements of the microarray of claim 46 with the labeled 
polynucleotides of the sample under conditions suitable for the formation of a 
hybridization complex, and 

c) quantifying the expression of the polynucleotides in the sample. 

48. An array comprising different nucleotide molecules affixed in distinct physical locations 
on a solid substrate, wherein at least one of said nucleotide molecules comprises a first 
oligonucleotide or polynucleotide sequence specifically hybridizable with at least 30 contiguous 
nucleotides of a target polynucleotide, and wherein said target polynucleotide is a polynucleotide of 
claim 12- 

49. An array of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 
completely complementary to at least 30 contiguous nucleotides of said target polynucleotide. 

50. An array of claim 48, wherein said .first oligonucleotide or polynucleotide sequence is 
completely complementary to at least 60 contiguous nucleotides of said target polynucleotide: 

. 51. An array of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 
completely complementary to said target polynucleotide. 

52. An array of claim 48, which is a microarray. 

53. An array of claim 48, further comprising said target polynucleotide hybridized to a 
nucleotide molecule conq)rismg said first oligonucleotide or polynucleotide sequence. 

54. An array of claim 48, wherein a linker joins at least one of said nucleotide molecules to 
said solid substrate. 

55. An array of claim 48, wherein each distinct physical location on the substrate contains 
multiple nucleotide molecules, and the multiple nucleotide molecules at any single distinct physical 
location have the same sequence, and each distinct physical location on the substrate contains 
nucleotide molecules having a sequence which differs from the sequence of nucleotide molecules at 
another distinct physical location on the substrate. 
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56. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:l. 

57. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:2. 

58. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:3. 

59. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:4. 

60. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:5. 

61. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:6. 

62. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:7. 

63. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:8. 

64. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:9. 

65. A polypeptide ofclaiml, comprising the amino acid sequence of SEQ ID NO: 10. 

66. A polypeptide of claim 1, comprising the ammo acid sequence of SEQ ID NO: 11. 

67. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 12. 

68. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 13, 

69. A polypeptide of claim 1. comprising the amino acid sequence of SEQ ID NO: 14. 

70. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 15. 

71. A polypeptide of claim 1, comprising the amrao acid sequence of SEQ ID NO: 16. 

72. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 17. 

73. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 18. 
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74. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 19. 

75. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:2b. 

76. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:21. 

77. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:22. 

78. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:23. 

79. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:24. 

80. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:25. 

81. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:26. 

82. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:27. 

83. A polypeptide of claim 1, conqprising the amino acid sequence of SEQ ID NO:28. 

84. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:29. 

85. A polypeptide of claun 1, comprising the amino acid sequence of SEQ ID NO:30. 

86. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:31. 

87. A polypeptide of claim 1, con^rising the amino acid sequence of SEQ ID NO:32. 

88. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:33. 

89. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 34. 

90. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:35. 

91. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 36. 
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92. A polyjpeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:37. 

93. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:38. 

94. A polypeptide of claim 1, con^rising the amino acid sequence of SEQ ID NO: 39. 

95. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:40. 

96. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:41. 

97. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:42. 

98. A polypeptide of claim 1, comprising the anaino acid sequence of SEQ ID NO:43. 

99. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:44. 

100. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:45. 

101. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:46. 

102. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:47. 

103. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:48. 

104. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:49. 

105. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:50. 

106. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:5 1. 

107. A polypeptide of claim 1, comprising the amino acid sequence of SEQ E) NO:52. 

108. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:53. 

109. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:54. 
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1 10. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:55. 

111. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 56. 

112. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:57. 

113. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:58. 

1 14. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:59. 

115. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:60. 

1 16. A polynucleotide of claim 12. comprising the polynucleotide sequence of SEQ ID 

NO:61. 

117. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:62. 

1 18. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:63. 

119. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:64. 

120. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:65. 

121. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:66. 

122. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:67. 
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123. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:68. 

124. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:69. 

125. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:70. 

126. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:71. 

127. A polynucleotide of claun 12, comprising the polynucleotide sequence of SEQ ID 

NO:72. 

128. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:73. 

129. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:74. 

130. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:75. 



131. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:76. 

132. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:77. 

133. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:78. 

134. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:79. 
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135. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:80. 

136. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:8L 

137. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:82. 

138. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:83. 

139. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:84. 

140. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:85. 

141. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:86. 

142. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:87. 

143. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:88. 

144. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:89. 

145. A polynucleotide of claim 12, comprismg the polynucleotide sequence of SEQ ID 

NO:90. 

146. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:91. 

288 



wo 03/063688 PCTAJS03/02500 

147. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:92. 

148. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:93. 

149. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:94. 

150. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:95. 

151. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:96. 

152. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:97. 

153. Apolynucleotideof claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:98. 

154. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:99. 

155. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO: 100. 

156. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:101. 

157. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:102. 

158. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO: 103. 
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159. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO: 104. 

160. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO: 105. 

161. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO: 106. 

162. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:107. 

163. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:108. 

164. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO: 109. 

165. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO: 110. 

166. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:lll. 

167. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO: 112. 

168. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:113. 

169. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO: 114. 

170. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO: 115. 
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171. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 



NO: 116. 
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155 160 165 

Glv Tyr Val Gly Leu Val Asn Gin Ala Met Thr Cys Tyr Leu Asn 
170 175 180 

Ser Leu Leu Gin Thr Leu Phe Met Thr Pro Glu Phe Arg Asn Ala 
185 190 195 

Leu Tyr Lys Trp Glu Phe Glu Glu Ser Glu Glu Asp Pro Val Thr 
200 205 210 

Ser lie Pro Tyr Gin Leu Gin Arg Leu Phe Val Leu Leu Gin Thr 
215 220 225 

Ser Lys Lys Arg Ala lie Glu Thr Thr Asp Val Thr Arg Ser Phe 
230 235 240 

Gly Trp Asp Ser Ser Glu Ala Trp Gin Gin His Asp Val Gin Glu 
245 250 255 

Leu Cys Arg Val Met Phe Asp Ala Leu Glu Gin Lys Trp Lys Gin 
260 265 270 

Thr Glu Gin Ala Asp Leu lie Asn Glu Leu Tyr Gin Gly Lys Leu 
275 280 285 

Lys Asp Tyr Val Arg Cys Leu Glu Cys Gly Tyr Glu Gly Trp Arg 
290 295 300 

lie Asp Thr Tyr Leu Asp lie Pro Leu Val lie Arg Pro Tyr Gly 
305 310 315 

Ser Ser Gin Ala Phe Ala Ser Val Glu Glu Ala Leu His Ala Phe 
320 325 330 

lie Gin Pro Glu lie Leu Asp Gly Pro Asn Gin Tyr Phe Cys Glu 
335 340 345 

Arg Cys Lys Lys Lys Cys Asp Ala Arg Lys Gly Leu Arg Phe Leu 
350 355 360 

His Phe Pro Tyr Leu Leu Thr Leu Gin Leu Lys Arg Phe Asp Phe 
365 370 375 

Asp Tyr Thr Thr Met His Arg lie Lys Leu Asn Asp Arg Met Thr 
380 385 390 

Phe Pro Glu Glu Leu Asp Met Ser Thr Phe lie Asp Val Glu Asp 
395 400 405 

Glu Lys Ser Pro Gin Thr Glu Ser Cys Thr Asp Ser Gly Ala Glu ^ 

410 415 420 

Asn Glu Gly Ser Cys His Ser Asp Gin Met Ser Asn Asp Phe Ser 
425 430 435 

Asn Asp Asp Gly Val Asp Glu Gly He Cys Leu Glu Thr Asn Ser 
440 445 450 

Gly Thr Glu Lys He Ser Lys Ser Gly Leu Glu Lys Asn Ser Leu 
455 460 465 

He Tyr Glu Leu Phe Ser Val Met Val His Ser Gly ' Ser Ala Ala 
470 475 480 

Gly Gly His Tyr Tyr Ala Cys He Lys Ser Phe Ser Asp Glu Gin 
485 490 495 

Trp Tyr Ser Phe Asn Asp Gin His Val Ser Arg He Thr Gin Glu 
500 505 510 

Asp He Lys Lys Thr His Gly Gly Ser Ser Gly Ser Arg Gly Tyr 
515 520 525 

Tyr Ser Ser Ala Phe Ala Ser Ser Thr Asn Ala Tyr Met Leu He 
530 535 540 

Tyr Arg Leu Lys Asp Pro Ala Arg Asn Ala Lys Phe Leu Glu Val 
545 550 555 

Asp Glu Tyr Pro Glu His He Lys Asn Leu Val Gin Lys Glu Arg 
560 565 570 

Glu Leu Glu Glu Gin Glu Lys Arg Gin Arg Glu He Glu Arg Asn 
575 580 585 

Thr Cys Lys He Lys Leu Phe Cys Leu His Pro Thr Lys Gin Val 
590 595 600 

Met Met Glu Asn Lys Leu Glu Val His Lys Asp Lys Thr Leu Lys 
605 610 615 

Glu Ala Val Glu Met Ala Tyr Lys Met Met Asp Leu Glu Glu Val 
620 625 630 
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lie Pro 


Leu 


Asp 


Cys 


Cys 


Arg 


Leu 


Val 


Lys 


Tyr 


Asp 


Glu 


Phe 


His 






635 










640 










645 


Asp Tyr 


Leu Glu Arg Ser Tyr Glu Gly Glu Glu Asp 


Thr 


Pro 


Met 






650 










655 










660 


Glv Leu Leu Leu Gly Gly Val Lys 


Ser 


Thr 


Tyr 


Met 


Phe Asp 


Leu 








665 










670 










675 


Leu Leu 


Glu 


Thr 


Arg 


Lys 


Pro 


Asp 


Gin 


Val 


Phe 


Gin 


Ser 


Tyr 


Lys 








680 










685 










690 


Pro Gly Glu Val 


Met 


Val 


Lys 


Val 


His 


Val 


Val 


Asp 


Leu 


Lys 


Ala 








695 










700 










705 


Glu Ser 


Val 


Ala 


Ala 


Pro 


He 


Thr 


Val 


Arg 


Ala 


Tyr 


Leu 


Asn 


Gin 








710 










715 










720 


Thr Val 


Thr 


Glu 


Phe 


Lys 


Gin 


Leu 


He 


Ser 


Lys 


Ala 


He 


His 


Leu 








725 








730 










735 


Pro Ala 


Glu 


Thr 


Met 


Arg 


He 


Val 


Leu 


Glu 


Arg 


Cys 


Tyr 


Asn 


Asp 








740 










745 










750 


Leu Arg 


Leu 


Leu 


Ser 


Val 


Ser 


Ser 


Lys 


Thr 


Leu 


Lys 


Ala 


Glu 


Gly 






755 










760 










765 


Phe Phe 


Arg 


Ser 


Asn 


Lys 


Val 


Phe 


Val 


Glu 


Ser 


Ser 


Glu 


Thr 


Leu 






770 










775 










780 


Asp Tyr 


Gin 


Met 


Ala 


Phe 


Ala 


Asp 


Ser 


His 


Leu 


Trp 


Lys 


Leu 


Leu 






785 










790 










795 


Asp Arg 


His 


Ala 


Asn 


Thr 


He 


Arg 


Leu 


Phe 


Val 


Leu 


Leu 


Pro 


Glu 






800 










805 










810 


Gin Ser 


Pro 


Val 


Ser 


Tyr 


Ser 


Lys 


Arg 


Thr 


Ala 


Tyr 


Gin 


Lys 


Ala 








815 










820 










825 


Gly Gly Asp 


Ser 


Gly Asn Val 


Asp 


Asp Asp 


Cys 


Glu 


Arg 


Val 


Lys 








830 










835 










840 


Gly Pro 


Val 


Gly 


Ser 


Leu 


Lys 


Ser 


Val 


Glu 


Ala 


He 


Leu 


Glu 


Glu 






845 










850 










855 


Ser Thr 


Glu 


Lys 


Leu 


Lys 


Ser 


Leu 


Ser 


Leu 


Gin 


Gin 


Gin 


Gin 


Asp 






860 










865 










870 


Gly Asp Asn Gly Asp 


Ser 


Ser 


Lys 


Ser 


Thr 


Glu 


Thr 


Ser 


Asp 


Phe 








875 










880 










885 


Glu Asn 


lie 


Glu 


Ser 


Pro 


Leu 


Asn 


Glu 


Arg Asp 


Ser 


Ser 


Ala 


Ser 








890 










895 










900 


Val Asp 


Asn 


Arg 


Glu 


Leu 


Glu 


Gin 


His 


He 


Gin 


Thr 


Ser 


Asp 


Pro 






905 










910 










915 


Glu Asn 


Phe 


Gin 


Ser 


Glu 


Glu 


Arg 


Ser 


Asp 


Ser 


Asp 


Val 


Asn 


Asn 








920 










925 










930 


Asp Arg 


Ser 


Thr 


Ser 


Ser 


Val 


Asp 


Ser 


Asp 


He 


Leu 


Ser 


Ser 


Ser 






935 










940 










945 


His Ser 


Ser 


Asp 


Thr 


Leu 


Cys 


Asn 


Ala 


Asp 


Asn 


Ala 


Gin 


He 


Pro 






950 










955 










960 


Leu Ala 


Asn 


Gly 


Leu 


Asp 


Ser 


His 


Ser 


He 


Thr 


Ser 


Ser 


Arg 


Arg 








965 










970 










975 


Thr Lys 


Ala 


Asn 


Glu 


Gly 


Lys 


Lys 


Glu 


Thr 


Trp 


Asp 


Thr 


Ala 


Glu 






980 










985 










990 


Glu Asp 


Ser Gly 


Thr 


Asp 


Ser 


Glu 


Tyr 


Asp 


Glu 


Ser 


Gly 


Lys 


Ser 






995 










1000 










1005 


Arg Gly Glu Met 


Gin 


Tyr 


Met 


Tyr 


Phe 


Lys 


Ala 


Glu 


Pro 


Tyr 


Ala 








1010 










1015 










1020 


Ala Asp Glu Gly Ser Gly Glu Gly His 


Lys 


Trp 


Leu 


Met 


Val 


His 








1025 










1030 










1035 


Val Asp 


Lys 


Arg 


He 


Thr 


Leu 


Ala 


Ala 


Phe 


Lys 


Gin 


His 


Leu 


Glu 




1040 










1045 










J.U3 u 


Pro Phe 


Val 


Gly Val 


Leu 


Ser 


Ser 


His 


Phe 


Lys 


Val 


Phe 


Arg 


Val 








1055 










1060 










1065 


Tyr Ala 


Ser 


Asn 


Gin 


Glu 


Phe 


Glu 


Ser 


Val 


Arg 


Leu 


Asn 


Glu 


Thr 






1070 










1075 










1080 


Leu Ser 


Ser 


Phe 


Ser 


Asp 


Asp 


Asn 


Lys 


He 


Thr 


He 


Arg 


Leu 


Gly 








1085 










1090 










1095 


Arg Ala 


Leu 


Lys 


Lys 


Gly Glu 


Tyr 


Arg 


Val 


Lys 


Val 


Tyr 


Gin 


Leu 
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1100 1105 1110 



Leu 


Val 


Asn 


Glu Gin 


Glu 


Pro 


Cys 


Lys Phe 


Leu 


Leu 


Asp 


Ala Val 








1115 








1120 








1125 


Phe 


Ala 


Lys 


Gly Met 


Thr 


Val 


Arg 


Gin Ser 


Lys 


Glu 


Glu 


Leu He 






1130 








1135 








1140 


Pro 


Gin 


Leu 


Arg Glu 


Gin 


Cys 


Gly 


Leu Glu 


Leu 


Ser 


He 


Asp Arg 








1145 








1150 








1155 


Phe 


Arg 


Leu 


Arg Lys 


Lys 


Thr 


Trp 


Lys Asn 


Pro 


Gly Thr Val Phe 






1160 








1165 








1170 


Leu 


Asp 


Tyr 


His He 


Tyr 


Glu 


Glu 


Asp He 


Asn 


He 


Ser 


Ser Asn 




1175 








1180 








1185 


Trp 


Glu 


Val 


Phe Leu 


Glu 


Val 


Leu 


Asp Gly Val 


Glu 


Lys 


Met Lys 






1190 








1195 








1200 


Ser 


Met 


Ser 


Gin Leu 


Ala 


Val 


Leu 


Ser Arg 


Arg 


Trp 


Lys 


Pro Ser 








1205 








1210 








1215 


Glu 


Met 


Lys 


Leu Asp 


Pro 


Phe 


Gin 


Glu Val 


Val 


Leu 


Glu 


Ser Ser 






1220 








1225 








1230 


Ser 


Val 


Asp 


Glu Leu 


Arg 


Glu 


Lys 


Leu Ser 


Glu 


He 


Ser 


Gly He 






1235 








1240 








1245 


Pro 


Leu 


Asp 


Asp He 


Glu 


Phe 


Ala 


Lys Gly Arg Gly Thr 


Phe Pro 








1250 








1255 








1260 


Cys 


Asp 


He 


Ser Val 


Leu 


Asp 


He 


His Gin 


Asp 


Leu 


Asp 


Trp Asn 




1265 








1270 








1275 


Pro 


Lys 


Val 


Ser Thr 


Leu 


Asn 


Val 


Trp Pro 


Leu 


Tyr 


He 


Cys Asp 






1280 








1285 








1290 


Asp 


Gly Ala 


Val He 


Phe 


Tyr 


Arg 


Asp Lys 


Thr 


Glu 


Glu 


Leu Met 






1295 








1300 








1305 


Glu 


He 


Asp 


Arg 
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Met 


Asp 


Ala 


Leu 


Asp 


Ala 


Ser 


Lys 


Leu 


Leu 


Asp 


Glu 


Glu 


Leu 


Tyr 


1 






5 










10 










15 


Ser 


Arg 


Gin 


Leu 


Tyr 
20 


Val 


Leu 


Gly 


Ser 


Pro 
25 


Ala 


Met 


Gin 


Arg 


He 
30 


Gin 


Gly 


Ala 


Arg 


Val 


Leu 


Val 


Ser 


Gly 


Leu 


Gin Gly Leu Gly 


Ala 










35 










40 










45 


Glu 


Val 


Ala 


Lys 


Asn 


Leu 


Val 


Leu 


Met 


Gly Val 


Gly 


Ser 


Leu 


Thr 








50 










55 










60 


Leu 


His 


Asp 


Pro 


His 


Pro 


Thr 


Cys 


Trp 


Ser 


Asp 


Leu 


Ala 


Ala 


Gin 








65 










70 










75 


Phe 


Leu 


Leu 


Ser 


Glu 
80 


Gin 


Asp 


Leu 


Glu 


Arg 
85 


Ser 


Arg 


Ala 


Glu 


Ala 
90 


Ser 


Gin 


Glu 


Leu 


Leu 
95 


Ala 


Gin 


Leu 


Asn 


Arg 
100 


Ala 


Val 


Gin 


Val 


Val 
105 


Val 


His 


Thr 


Gly 


Asp 
110 


He 


Thr 


Glu 


Asp 


Leu 
115 


Leu 


Leu 


Asp 


Phe 


Gin 
120 


Val 


Val 


Val 


Leu 


Thr 


Ala 


Ala 


Lys 


Leu 


Glu 


Glu 


Gin 


Leu 


Lys 


Val 










125 








130 










135 


Gly 


Thr 


Leu 


Cys 


His 


Lys 


His 


Gly Val 


Cys 


Phe 


Leu 


Ala 


Ala 


Asp 








140 










145 










150 


Thr 


Arg 


Gly 


Leu 


Val 


Gly 


Gin 


Leu 


Phe 


Cys 


Asp 


Phe 


Gly 


Glu 


Asp 






155 










160 










165 


Phe 


Thr 


Val 


Gin 


Asp 


Pro 


Thr 


Glu 


Ala 


Glu 


Pro 


Leu 


Thr 


Ala 


Ala 
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He 


Gin 


His 


Lys 


Gly 


Ala 


Phe 


Ser 


Gly 

^ J; 


A2rg 


SejT 


He 


Thr 


Thir 


Thr 


Val 


Lys 


Arg 


LeuL 


Leu 


Gin 


His 


Ala 


His 


Gin 


His 


Leu 


Ala 


Glu 


Thr 




Tlur 


Glu 


Val 


Arg" 


Thr 


Val 


Ala 


Met 


He 


Ser 


Arg 


Ala 


Leu 


Asp 


Glu 


Asp 


Cys 


\7Pil 

V CLJ. 




\j My 


Levi 


Leu 


Val 


Phe 


Ala 


Leu 


Val 


Val 


Asp 


Phe 


Leu 


Phe 


Ala 


Ala 


Ala 


PiTO 


Leu 


Thr 


Asp 


Asn 


Phe 


Se3r 


Phe 


Gin 


Leu 


Lys 


Pro 


Ala 


Thr 


Val 


Ala 


Ser 


Ala 


Val 


Arg 


Tyr 


Arg 


His 


Glu 


Asn 


His 


His 





170 




He 


Ser 


Gin 




185 




Asn 


Thr 


His 




200 




He 


Glu 


Gly 




215 




His 


Val 


Arg 




230 




Phe 


Ser 


Arg 




245 




Pro 


Lys 


Thr 




260 




Pro 


His 


Val 




275 




Cys 


Leu 


His 




290 




His 


Gly 


Arg 




305 




Val 


Val 


Gly 




320 




Glu 


Glu 


Pro 




335 




Val 


Ala 


Leu 




350 




Leu 


Gly 


Ala 




365 




Lys 


Phe 


Met 




380 




Cys 


Leu 


Pro 




395 




Ala 


Leu 


Arg 




410 




Ala 


Gly 


Phe 




425 




Gly 


Ala 


Gly 




440 




Val 


Gly 


Leu 




455 




Met 


Asp 


His 




470 




Arg 


Ser 


Gin 




485 




Ala 


Ala 


Arg 




500 




Tyr 


Pro 


Leu 




515 




Phe 


Ser 


Arg 




530 




Ala 


Arg 


Arg 




545 




Leu 


Leu 


Glu 




560 




Phe 


Met 


Pro 




575 




Ala 


Ala 


Ser 




590 




Phe 


Pro 


Ser 




605 




Phe 


Glu 


Glu 




620 




Gin 


Gin 


Ala 




635 





Gly 


Ser 


Pro 


Tyr 


Phe 


Arg 


Met 


Val 


Glu 


Glu 


Asp 


Glv 


Tvr 


Leu 


Arg 


Val 


Arg 


His 


Val 


Ala 


Gin 


Gin 


Ala 


Phe 


Pro 


Pro 


Gin 


Leu 


Ala 


Arg 


Leu 


Glu 


Glu 


Ser 


Ser 


Ala 


Val 


Ala 


Ala 


Pro 


Leu 


Asp 


Glu 


Asp 


Glv 


Gly 


Ser 




Gin 


Glu 


Lys 


Ala 


He 


Gly 


Gly 


Ala 


Gly 


He 


Glu 


Arg 




Val 




Gly 


Leu 


Asn 


Asp 


Pro 


Thr 


Val 


Asp 


Glv 


Tvr 


Val 


Ala 


Ala 


Glv 


Thr 


His 


Val 


Thr 


Glu 


Asp 


Ala 


Thr 


Ala 


Glu 


Leu 


Phe 


Arg 


His 


Thr 


Ser 



175 






Gly 


He 


Leu 


190 






Asp 


Gly 


Asp 


205 






Leu 


Asn 


Asp 


220 






Ser 


Leu 


Glu 


235 






Gly 


Gly 


Ala 


250 






Lys 


Ser 


Leu 


265 






Ser 


Ser 


Gin 


280 






Cys 


Ala 


Leu 


295 






Pro 


Trp 


Asp 


310 






Asp 


Leu 


Glu 


325 






Pro 


Leu 


Asp 


340 






Gly 


Val 


Leu 


355 






Gin 


Glu 


Val 


370 






Gin 


Trp 


Leu 


385 






Glu 


Leu 


Leu 


400 






Tyr 


Asp 


Gly 


415 






Leu 


Arg 


Arg 


430 






Cys 


Glu 


Leu 


445 






Asn 


Ser 


Gly 


460 






Ser 


Asn 


Leu 


475 






Arg 


Pro 


Lys 


490 






Pro 


Asp 


Leu 


505 






Thr 


Glu 


His 


520 






Val 


Ala 


Ala 


535 






Ala 


Arg 


Cys 


550 






Ser 


Gly 


Thr 


565 






Glu 


Ala 


Tyr 


580 






Pro 


Tyr 


Pro 


595 






His 


Thr 


Leu 


610 






Leu 


Ser 


Ala 


625 






Leu 


Ala 


Asp 


640 











180 


Thr 


Leu 


Arg 






195 


Leu 


Val 


Thr 






210 


Cys 


Asp 


Pro 






225 


He 


Gly 


Asp 






240 


He 


Thr 


Glu 






255 


Asp 


Thr 


Ala 






270 


Glu 


Val 


His 






285 


His 


Lys 


Phe 






300 


Pro 


Val 


Asp 






315 


Pro 


Leu 


Lys 






330 


Glu 


Ala 


Leu 






345 


Ser 


Prd 


Met 






360 


Leu 


Lys 


Ala 






375 


Tyr 


Phe 


Asp 






390 


Pro 


Ser 


Pro 






405 


Gin 


He 


Ala 






420 


Gin 


His 


Tyr 






435 


Leu 


Lys 


Val 






450 


Gly 


Leu 


Thr 






465 


Ser 


Arg 


Gin 






480 


Ala 


Glu 


Val 






495 


Gin 


Val 


He 






510 


He 


Tyr 


Gly 






525 


Ala 


Leu 


Asp 






540 


Thr 


His 


Tyr 






555 


Trp 


Gly 


Ser 






570 


Arg 


Ala 


Pro 






585 


Val 


Cys 


Thr 






600 


Gin 


Trp 


Ala 






615 


Glu 


Thr 


He 






630 


Met 


Asp 


Glu 






645 
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Pro 


Gin 


Thr 


Leu 


Thr 


Leu 


Leu 


Lys 


Pro 


Val 


Leu Gly Val 


Leu 


Arg 










650 










655 










660 


Val 


Arg 


Pro 


Gin 


Asn 


Trp 


Gin Asp 


Cys 


Val 


Ala 


Trp 


Ala 


Leu Gly 








665 










670 










675 


His 


Trp 


Lys 


Leu 


Cys 


Phe 


His 


Tyr 


Gly He 


Lys 


Gin 


Leu 


Leu 


Arg 






680 










685 










690 


His 


Phe 


Pro 


Pro 


Asn 


Lys 


Asp 


Thr 


His 


Leu 


Leu 


Tyr 


Val 


Leu 


Ala 










695 








700 










705 


Ala 


Ala 


Asn 


Leu 


Tyr 
710 


Ala 


Gin 


Met 


His 


Gly, 
715 


Leu 


Pro 


Gly 


Ser 


Gin 
720 


Asp 


Trp 


Thr 


Ala 


Leu 


Arg 


Glu 


Leu 


Leu 


Lys 


Leu 


Leu 


Pro 


Gin 


Pro 






725 










730 










735 


Asp 


Pro 


Gin 


Gin 


Met 


Ala 


Pro 


He 


Phe 


Ala 


Ser 


Asn 


Leu 


Glu 


Leu 








740 










745 










750 


Ala 


Ser 


Ala 


Ser 


Ala 
755 


Glu 


Phe 


Gly 


Pro 


Glu 
760 


Gin 


Gin 


Lys 


Glu 


Leu 
765 


Asn 


Lys 


Ala 


Leu 


Glu 


Val 


Trp 


Ser 


Val 


Gly 


Pro 


Pro 


Leu 


Lys 


Pro 








770 










775 










780 


Leu 


Met 


Phe 


Glu 


Lys 
785 


Asp 


Asp 


Asp 


Ser 


Asn 
790 


Phe 


His 


Val 


Asp 


Phe 
795 


Val 


Val 


Ala 


Ala 


Ala 
800 


Ser 


Leu 


Arg 


Cys 


Gin 
805 


Asn 


Tyr 


Gly 


He 


Pro 
810 


Pro 


Val 


Asn 


Arg 


Ala 


Gin 


Ser 


Lys 


Arg 


He 


Val 


Gly Gin 


He 


He 








815 










820 










825 


Pro 


Ala 


lie 


Ala 


Thr 
830 


Thr 


Thr 


Ala 


Ala 


Val 
835 


Ala 


Gly 


Leu 


Leu 


Gly 
840 


Leu 


Glu 


Leu 


Tyr 


Lys 


Val 


Val 


Ser 


Gly 


Pro 


Arg 


Pro 


Arg 


Ser 


Ala 








845 










850 










855 


Phe 


Arg 


His 


Ser 


Tyr 


Leu 


His 


Leu 


Ala 


Glu 


Asn 


Tyr 


Leu 


He 


Arg 








860 










865 










870 


Tyr 


Met 


Pro 


Phe 


Ala 


Pro 


Ala 


He 


Gin 


Thr 


Phe 


His 


His 


Leu 


Lys 








875 










880 










885 


T2rp 


Thr 


Ser 


Trp 


Asp 


Arg 


Leu 


Lys 


Val 


Pro 


Ala 


Gly 


Gin 


Pro 


Glu 






890 










895 










900 


Arg 


Thr 


Leu 


Glu 


Ser 


Leu 


Leu 


Ala 


His 


Leu 


Gin 


Glu 


Gin 


His 


Gly 








905 










910 










915 


Leu 


Arg 


Val 


Arg 


lie 


Leu 


Leu 


His 


Gly 


Ser 


Ala 


Leu 


Leu 


Tyr 


Ala 








920 










925 










930 


Ala 


Gly 


Trp 


Ser 


Pro 


Glu 


Lys 


Gin 


Ala 


Gin 


His 


Leu 


Pro 


Leu 


Arg 






935 










940 










945 


Val 


Thr 


Glu 


Leu 


Val 
950 


Gin 


Gin 


Leu 


Thr 


Gly 
955 


Gin 


Ala 


Pro 


Ala 


Pro 
960 


Gly 


Gin 


Arg 


Val 


Leu 


Val 


Leu 


Glu 


Leu 


Ser 


Cys 


Glu 


Gly Asp Asp 






965 










970 










975 


Glu 


Asp 


Thr 


Ala 


Phe 


Pro 


Pro 


Leu 


His 


Tyr 


Glu 


Leu 














980 










985 
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Met Asn Leu Leu Leu He Leu Thr Phe Val Ala Ala Ala Val Ala 

15 10 15 

Ala Pro Phe Asp Asp Asp Asp Lys He Val Gly Gly Tyr He Cys 

20 25 30 

Glu Glu Asn Ser Val Pro Tyr Gin Val Ser Leu Asn Ser Gly Tyr 

35 40 45 
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Lys 


Ser 


Arg 


He Gin Val 


Arg 






50 




Leu 


Glu 


Gly Asn Glu Gin 


Phe 








65 




His 


Pro 


Lys 


Tyr Asn Ser 


Arg 








80 




He 


Lys 


Leu 


Ser Ser Pro 


Ala 








95 




He 


Ser 


Leu 


Pro Thr Ala 


Pro 








110 




He 


Ser 


Gly 


Trp Gly Asn 


Thr 








125 




Asp 


Glu 


Leu 


Gin Cys Leu 


Asp 








140 




Cys 


Glu 


Ala 


Ser Tyr Pro 


Gly 






155 




Val 


Gly 


Phe 


Leu Glu Gly Gly 








170 




Gly 


Gly 


Pro 


Val Val Ser 


Asn 








185 




Trp 


Gly 


Tyr Gly Cys Ala Gin 








200 




Lys 


Val 


Tyr 


Asn Tyr Val 


Asp 








215 




Asn 


Ser 









Leu 


Gly Glu 


His Asn 


He 


Glu 


Val 






55 








60 


He 


Asn 


Ala 


Ala Lys 


He 


He 


Arg 






70 








75 


Thr 


Leu Asp Asn Asp 


He 


Leu 


Leu 






85 








90 


Val 


He 


Asn 


Ser Arg 


Val 


Ser 


Ala 






100 








105 


Pro 


Ala 


Ala 


Gly Thr 


Glu 


Ser 


Leu 






115 








120 


Leu 


Ser 


Ser 


Gly Ala 


Asp 


Tyr 


Pro 






130 








135 


Ala 


Pro 


Val 


Leu Ser 


Gin 


Ala 


Glu 






145 








150 


Lys 


He 


Thr 


Asn Asn 


Met 


Phe 


Cys 




160 








165 


Lys 


Asp 


Ser 


Cys Gin 


Gly Asp 


Ser 




175 








180 


Gly Glu 


Leu 


Gin Gly 


He 


Val 


Ser 






190 








195 


Lys 


Asn Arg 


Pro Gly Val 


Tyr 


Thr 




205 








210 


Trp 


He 


Lys 


Asp Thr 


He 


Ala 


Ala 




220 








225 



<210> 5 

<211> 727 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7506021CD1 

<400> 5 



Met 


Lys 


Lys 


Gin 


Arg 


Lys 


He 


Leu 


Trp 


Arg 


Lys 


Gly 


He 


His 


Leu 


1 




5 










10 










15 


Ala 


Phe 


Ser 


Glu 


Lys 


Trp 


Asn 


Thr 


Gly 


Phe 


Gly Gly 


Phe 


Lys 


Lys 










20 








25 










30 


Phe 


Tyr 


Phe 


His 


Gin 


His 


Leu 


Cys 


He 


Leu 


Lys 


Ala 


Lys 


Leu 


Gly 








35 










40 










45 


Arg 


Pro 


Val 


Thr 


Trp 


Asn 


Arg 


Gin 


Leu 


Arg 


His 


Phe 


Gin Gly Arg 








50 










55 










60 


Lys 


Lys 


Ala 


Leu 


Gin 


He 


Gin 


Lys 


Thr 


Tarp 


He 


Lys 


Asp 


Glu 


Pro 






65 










70 










75 


Leu 


Cys 


Ala 


Lys 


Thr 


Lys 


Phe 


Asn 


Val 


Ala 


Thr 


Gin 


Asn 


Val 


Ser 






80 










85 










90 


Thr 


Leu 


Ser 


Ser Lys" Val 


Lys 


Arg 


Lys 


Asp 


Ala 


Lys 


His 


Phe 


He 










95 










100 










105 


Ser 


Ser 


Ser 


Lys 


Thr 


Leu 


Leu 


Arg 


Leu 


Gin 


Ala 


Glu 


Lys 


Leu 


Leu 








110 










115 










120 


Ser 


Ser 


Ala 


Lys 


Asn 


Ser 


Asp 


His 


Glu 


Tyr 


Cys 


Arg 


Glu 


Lys 


Asn 








125 










130 










135 


Leu 


Leu 


Lys 


Ala 


Val 


Thr 


Asp 


Phe 


Pro 


Ser 


Asn 


Ser 


Ala 


Leu 


Gly 








140 










145 










150 


Gin 


Ala 


Asn 


Gly His 


Arg 


Pro 


Arg 


Thr 


Asp 


Pro 


Gin 


Pro 


Ser 


Asp 










155 










160 








Glu 


165 


Phe 


Pro 


Met 


Lys 


Phe 


Asn 


Gly 


Glu 


Ser 


Gin 


Ser 


Pro 


Val 


Ser 








170 










175 










180 


Glv Thr 


He 


Val 


Val 


Thr 


Leu 


Asn 


Asn 


His 


Lys 


Arg 


Lys 


Gly 


Phe 










185 










190 










195 
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Cys Tyr Arg 
Pro Leu lie 
Lys Leu Ser 
Phe Arg Tyr 
Lys Val Cys 
Val Thr Gly 
Gly Ser Cys 
Cys Arg His 
Val Lys Gly 
Ser Ser Phe 
Asp Gin Gin 
Cys Ser Ser 
Pro Leu Pro 
Glu Arg Glu 
Val Ser Asp 
Ser Val Ser 
Asn Glu Asn 
Ser lie Leu 
Pro Leu Glu 
Gin Val Gly 
Asp Glu Gin 
Met Lys Lys 
Leu Gly Arg 
Lys Pro Phe 
Gin Lys Cys 
Met Asp Asp 
Gin Val He 
Asp Lys Val 
Thr Lys Gly 
Leu Phe Lys 
His Trp Ser 
Phe Tyr Asp 



Cys 


Cys 


Gin 




200 




Pro 


Lys 


Lys 




215 




Pro 


Leu 


Met 




230 




Arg 


He 


Leu 




245 




Lys 


Leu 


Arg 




260 




Asp 


His 


Gin 




275 




Ser 


Pro 


Phe 




290 




Gin 


Pro 


Tyr 




305 




Thr 


Asn 


Ser 




320 




Leu 


Gly 


Lys 




335 




Asn 


Gly 


Ser 




350 




Pro 


Lys 


Trp 




365 




Glu 


His 


Arg 




380 




He 


Met 


Thr 




395 




Asp 


Arg 


Val 




410 




Ser 


Val 


Asp 




425 




Ser 


Tyr 


Gin 




440 




Ser 


Ser 


Glu 




455 




Ala 


Pro 


Leu 




470 




Gly 


Gly 


Lys 




485 




Leu 


Ser 


Val 




500 




Tyr 


Gly 


Ser 




515 




Leu 


Lys 


Asp 




530 




He 


Asn 


Arg 




545 




Asn 


Phe 


Arg 




560 




Leu 


Ala 


Thr 




575 




Asn 


Met 


Tyr 




590 




His 


Phe 


Phe 




605 




Tyr 


Asn 


Gly 




620 




Lys 


Ser 


Leu 




635 




Leu 


He 


Thr 




650 




Ser 


Gin 


Gly 



Gly Pro Glu 
Phe Gin Leu 
Met Tyr Glu 
Arg Ser Gin 
Lys Ala Gin 
Glu Thr Arg 
Pro Ser Pro 
Phe Pro Asp 
His Val Pro 
Glu Leu Ser 
Ala Thr Asn 
Glu Cys Thr 
Ser Asn Thr 
Leu Gly Gin 
Lys Leu Ser 
Gly Pro Val 
Met Glu Glu 
Leu Leu Asp 
Val Cys Ser 
Asn Ser Gin 
Cys Leu Ser 
Leu Val Pro 
Val Phe Asn 
Glu He Thr 
He Phe Tyr 
Leu Asp Gly 
Gly Glu Leu 
Asn Ser Phe 
Val Lys Arg 
Leu Leu He 
Val Thr Leu 
He His Phe 



His 


His 


Arg 


205 






Asn 


Gin 


His 


220 






Lys 


Leu 


Ser 


235 






His 


Phe 


Arg 


250 






Arg 


Ser 


Trp 


265 






Arg 


Glu 


Asn 


280 






Glu 


Pro 


Lys 


295 






Met 


Asp 


Ser 


310 






Asp 


Cys 


His 


325 






Leu 


Asp 


Glu 


340 






Ala 


Trp 


Asp 


355 






Glu 


Leu 


He 


370 






Met 


Phe 


He 


385 






Glu 


Asn 


Gin 


400 






Val 


Ser 


Gly 


415 






Ser 


Gin 


Lys 


430 






Asp 


Gly 


Ser 


445 






His 


Pro 


Tyr 


460 






Gly 


Leu 


Lys 


475 






Lys 


Ala 


Ser 


49 0 






Gly 


Phe 


Leu 


505 






Leu 


Ser 


Glu 


520 






Glu 


Asp 


Phe 


535 






Asn 


Tyr 


Arg 


550 






Asn 


Lys 


His 


565 






Gin 


Asn 


Trp 


580 






He 


Met 


Asp 


595 






Phe 


His 


Arg 


610 






Trp 


Thr 


Lys 


625 






Pro 


He 


His 


640 






Ser 


Asn 


Arg 


655 






Lys 


Phe 


Cys 



Asn 


Gly 


Gly 






210 


Arg 


Arg 


He 






225 


Met 


He 


Arg 






240 


Thr 


Lys 


Ser 






255 


Val 


Gin 


Lys 






270 


Gly 


Glu 


Gly 






285 


Asp 


Pro 


Ser 






300 


Ser 


Ala 


Val 






315 


Thr 


Lys 


Gly 






330 


Ala 


Phe 


Pro 






345 


Gin 


Ser 


Ser 






360 


His 


Asp 


He 






375 


Ser 


Glu 


Thr 






390 


Thr 


Ser 


Ser 






405 


Ala 


Asp 


Thr 






420 


Ala 


Val 


Gin 






435 


Leu 


Lys 


Gin 






450 


Cys 


Lys 


Ser 






465 


Leu 


Glu 


Asn 






480 


Pro 


Val 


Asp 






495 


Asp 


Glu 


Val 






510 


Lys 


Glu 


Val 






525 


Ser 


Asn 


Arg 






540 


Ala 


Arg 


His 






555 


Met 


Leu 


Asp 






570 


Leu 


Asn 


Asp 






585 


Ala 


Val 


Pro 






600 


Gin 


Leu 


Val 






615 


Lys 


Val 


Asp 






630 


Leu 


Glu 


Val 






645 


He 


He 


Ser 






660 


Val 


Glu 


Cys 
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665 










670 








675 


He Pro Gin Gin 


Lys 
680 


Asn 


Asp 


Ser 


Asp 


Cys 
685 


Gly Val 


Phe 


Val 


Leu 
690 


Gin Tyr Cys Lys 


Cys 


Leu 


Ala 


Leu 


Glu 


Gin 


Pro Phe 


Gin 


Phe 


Ser 


695 










700 








705 


Gin Glu Asp Met 


Pro 
710 


Arg 


Val 


Arg 


Lys 


Arg 
715 


He Tyr 


Lys 


Glu 


Leu 
720 


Cys Glu Cys Arg 


Leu 
725 


Met 


Asp 

















<210> 6 
<211> 143 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> inisc_f eature 

<223> Incyte ID No: 7503356CD1 

<400> 6 



Met 


Ser 


Leu 


Trp 


Pro 


Pro 


Phe 


Arg 


Cys 


Arg 


Trp 


Lys 


Leu 


Ala 


Pro 


1 








5 










10 










15 


Arg 


Tyr 


Ser 


Arg 


Arg 


Ala 


Ser 


Pro 


Gin 


Gin 


Pro 


Gin 


Gin 


Asp 


Phe 








20 










25 










30 


Glu 


Ala 


Leu 


Leu 


Ala 


Glu 


Cys 


Leu 


Arg 


Asn 


Gly Cys 


Leu 


Phe 


Glu 










35 










40 










45 


Asp 


Thr 


Ser 


Phe 


Pro 


Ala 


Thr 


Leu 


Ser 


Ser 


He 


Gly 


Ser 


Gly 


Ser 








50 










55 










60 


Leu 


Leu 


Gin 


Lys 


Leu 


Pro 


Pro 


Arg 


Leu 


Gin 


Trp 


Lys 


Arg 


Pro 


Pro 










65 










70 










75 


Glu 


Leu 


His 


Ser 


Asn 


Pro 


Gin 


Phe 


Tyr 


Phe 


Ala 


Lys 


Ala 


Lys 


Arg 










80 










85 










90 


Leu 


Asp 


Leu 


Cys 


Gin 


Gly 


He 


Val 


Gly Asp 


Cys 


T3rp 


Phe 


Leu 


Ala 










95 










100 










105 


Ala 


Leu 


Gin 


Ala 


Leu 


Ala 


Leu 


His 


Gin 


Asp 


He 


Leu 


Ser Arg Val 










110 










115 










120 


Val 


Pro 


Leu 


Asn 


Gin 


Ser 


Phe 


Thr 


Glu 


Lys 


Tyr 


Ala 


Gly 


He 


Phe 










125 










130 










135 


Arg 


Phe 


Trp 


Ala 


Leu 


Trp 


Phe 


Leu 

















140 

<210> 7 
<211> 31 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7509052CD1 

<400> 7 

Met Val Asn Pro Thr Val Phe Phe Asp He Ala Val Asp Gly Glu 
1 5 ■ 10 15 

Pro Leu Gly Arg Val Ser Phe Glu Val Arg Gly Leu Asp Thr Lys 
20 25 30 

Lys 



<210> 8 
<211> 630 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> inisc„f eature 

<223> Incyte ID No: 7503366CD1 

<400> 8 



Met 


Thr 


Ala 


Glu 


Leu 


Gin 


Gin 


Asp 


1 








5 








Gly His 


Gly 


Ser 


Ser 


Cys 


Gin 


Met 










20 








lie 


Thr 


Gly 


He 


Gin 
35 


Asp 


Pro 


Ser 


Ala 


Ser 


Asn 


Gly 


Asp 
50 


He 


Thr 


Gin 


Glu 


Arg 


Val 


Lys 


Glu 
65 


Pro 


Ser 


Gin 


Ser 


Glu 


Val 


Glu 


Gly 
80 


Ser 


Ala 


Ala 


Val 


lie 


Asp 


Leu 


Thr 
95 


His 


Asp 


Asn 


lie 


Ala 


Leu 


Ser 


Leu 
110 


Leu 


Glu 


Ser 


Arg 


Asp 


Leu 


Asn 


Arg 
125 


Met 


His 


Glu 


Arg 


Ser 


Lys 


Arg 


Lys 
140 


Arg 


Cys 


Glu 


Pro 


Asn 


Asp 


Trp 


Arg 
155 


Arg 


Val 


Asp 


Asn 


Val 


Gly 


Asn 


Thr 
170 


Cys 


Trp 


Phe 


Phe 


Gin 


Leu 


Pro 


Glu 
185 


Phe 


Arg 


Arg 


Pro 


Gin 


Asn 


Val 


Leu 
200 


Glu 


Asn 


Cys 


Asn 


He 


Met 


Phe 


Met 
215 


Gin 


Glu 


Leu 


Met 


Gly 


Ser 


Asn 


Arg 
230 


Lys 


Phe 


Val 


Leu 


Leu 


Lys 


Gly 


Ala 
245 


Phe 


Arg 


Ser 


Val 


Ser 


Glu 


Phe 


Thr 
260 


His 


Lys 


Leu 


Phe 


Gin 


Leu 


Ala 


Val 
275 


Asn 


Val 


Asn 


Glu 


Asn 


Pro 


Met 


Val 
290 


Gin 


Leu 


Phe 


Gly Val 


Arg 


Glu 


Gly 


Lys 


Pro 


Phe 










305 








Gin 


Tyr 


Pro 


Leu 


Gin 
320 


Val 


Asn 


Gly 


Leu 


Glu 


Gly 


Ala 


Met 
335 


Val 


Glu 


Gly 


Asp 


His 


Ser 


Val 


Lys 
350 


Tyr 


Gly 


Gin 


Pro 


Pro 


Val 


Leu 


Thr 
365 


Phe 


Glu 


Leu 


Ser 


Leu 


Gly 


Gin 


Pro 
380 


Glu 


Lys 


He 


Gin 


He 


He 


Tyr 


Met 
395 


Asp 


Arg 


Tyr 


lie 


Arg 


Asn 


Lys 


Arg 

410 


Glu 


Cys 


He 


Lys 


He 


Leu 


Gin 


Gin 


Lys 


Leu 


Glu 



425 



Asp 


Ala 
10 


Ala 


Gly 


Ala 


Ala 


Asp 
15 


Leu 


Leu 
25 


Asn 


Gin 


Leu 


Arg 


Glu 
30 


Phe 


Leu 
40 


His 


Glu 


Ala 


Leu 


Lys 
45 


Ala 


Val 
55 


Ser 


Leu 


Leu 


Thr 


Asp 
60 


Asp 


Thr 
70 


Val 


Ala 


Thr 


Glu 


Pro 
75 


Asn 


Lys 
85 


Glu 


Val 


Leu 


Ala 


Lys 
90 


Lys 


Asp 
100 


Asp 


Leu 


Gin 


Ala 


Ala 
105 


Pro 


Lys 
115 


He 


Gin 


Ala 


Asp 


Gly 
120 


Ala 


Thr 
130 


Ser 


Ala 


Glu 


Thr 


Lys 
135 


Val 


Trp 
145 


Gly 


Glu 


Asn 


Pro 


Asn 
150 


Gly 


Trp 
160 


Pro 


Val 


Gly 


Leu 


Lys 
165 


Ser 


Ala 

175 


Val 


He 


Gin 


Ser 


Leu 

180 


Leu 


Val 
190 


Leu 


Ser 


Tyr 


Ser 


Leu 
195 


Arg 


Ser 
205 


His 


Thr 


Glu 


Lys 


Arg 
210 


Gin 


Tyr 
220 


Leu 


Phe 


Ala 


Leu 


Met 
225 


Asp 


Pro 
235 


Ser 


Ala 


Ala 


Leu 


Asp 
240 


Ser 


Glu 
250 


Glu 


Gin 


Gin 


Gin 


Asp 
255 


Leu 


Asp 
265 


Trp 


Leu 


Glu 


Asp 


Ala 
270 


Ser 


Ser 
280 


Pro 


Arg 


Asn 


Lys 


Ser 
285 


Tyr 


Gly 
295 


Thr 


Phe 


Leu 


Thr 


Glu 
300 


Cys 


Asn 
310 


Asn 


Glu 


Thr 


Phe 


Gly 
315 


Tyr 


Arg 
325 


Asn 


Leu 


Asp 


Glu 


Cys 
330 


Asp 


Val 
340 


Glu 


Leu 


Leu 


Pro 


Ser 
345 


Glu 


Arg 
355 


Trp 


Phe 


Thr 


Lys 


Leu 
360 


Ser 


Arg 
370 


Phe 


Glu 


Phe 


Asn 


Gin 
375 


His 


Asn 
385 


Lys 


Leu 


Glu 


Phe 


Pro 
390 


Met 


Tyr 
400 


Arg 


Ser 


Lys 


Glu 


Leu 
405 


Arg 


Lys 


Leu 


Lys 


Glu 


Glu 


He 


415 










420 


Arg 


Tyr 


Val 


Lys 


Tyr 


Gly 


Ser 


430 










435 
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Gly 


Pro 


Ala 


Arg 


Phe 


Pro 


Leu 


Pro Asp 


Met 


Leu 


Lys 


Tyr 


Val 


He 










440 










445 










450 


Glu 


Phe 


Ala 


Ser 


Thr 
455 


Lys 


Pro 


Ala 


Ser 


Glu 
460 


Ser 


Cys 


Pro 


Pro 


Glu 
465 


Ser 


Asp 


Thr 


His 


Met 
470 


Thr 


Leu 


Pro 


Leu 


Ser 
475 


Ser 


Val* 


His 


Cys 


Ser 
480 


Val 


Ser 


Asp 


Gin 


Thr 

485 


Ser 


Lys 


Glu 


Ser 


Thr 
490 


Ser 


Thr 


Glu 


Ser 


Ser 
495 


Ser 


Gin 


Asp 


Val 


Glu 
500 


Ser 


Thr 


Phe 


Ser 


Ser 
505 


Pro 


Glu 


Asp 


Ser 


Leu 
510 


Pro 


Lys 


Ser 


Lys 


Pro 
515 


Leu 


Thr 


Ser 


Ser 


Arg 
520 


Ser 


Ser 


Met 


Glu 


Met 
525 


Pro 


Ser 


Gin 


Pro 


Ala 
530 


Pro 


Arg 


Thr 


Val 


Thr 
535 


Asp 


Glu 


Glu 


He 


Asn 
540 


Phe 


Val 


Lys 


Thr 


Cys 
545 


Leu 


Gin 


Arg 


Trp 


Arg 
550 


Ser 


Glu 


He 


Glu 


Gin 
555 


Asp 


lie 


Gin 


Asp 


Leu 
560 


Lys 


Thr 


Cys 


He 


Ala 
565 


Ser 


Thr 


Thr 


Gin 


Thr 
570 


lie 


Glu 


Gin 


Met 


Tyr 
575 


Cys 


Asp 


Pro 


Leu 


Leu 
580 


Arg 


Gin 


Phe 


Leu 


Phe 
585 


Met 


Lys 


Asp 


Lys 


Gin 
590 


Met 


Leu 


Asp 


Thr 


He 
595 


Gly 


Pro 


He 


Ser 


He 
600 


lie 


Asn 


Pro 


Asp 


Arg 
605 


Ala 


Gly 


Ser 


Ser 


Thr 
610 


Met 


Thr 


Ser 


Leu 


Leu 
615 


Leu 


Asn 


Leu 


Pro 


Gly 
620 


Lys 


Lys 


Leu 


Lys 


Glu 
625 


He 


Pro 


Met 


Glu 


Ala 
630 



<210> 9 

<211> 354 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 7505933CD1 

<400> 9 



Met 


Val 


Leu 


Glu 


Ser 


Thr 


Met 


Val 


Cys 


Val 


Asp 


Asn 


Ser 


Glu 


Tyr 


1 








5 










10 










15 


Met 


Arg 


Asn 


Gly 


Asp 


Phe 


Leu 


Pro 


Thr 


Arg 


Leu 


Gin 


Ala 


Gin 


Gin 










20 










25 










30 


Asp 


Ala 


Val 


Asn 


He 


Val 


Cys 


His 


Ser 


Lys 


Thr 


Arg 


Ser 


Asn 


Pro 










35 










40 










45 


Glu 


Asn 


Asn 


Val 


Gly 


Leu 


He 


Thr 


Leu 


Ala 


Asn 


Asp 


Cys 


Glu 


Val 










50 










55 










60 


Leu 


Thr 


Thr 


Leu 


Thr 


Pro 


Asp 


Thr 


Gly 


Arg 


He 


Leu 


Ser 


Lys 


Leu 










65 










70 










75 


His 


Thr 


Val 


Gin 


Pro 


Lys 


Gly 


Lys 


He 


Thr 


Phe 


Cys 


Thr 


Gly 


He 










80 










85 










90 


Arg 


Val 


Ala 


His 


Leu 


Ala 


Leu 


Lys 


His 


Arg 


Gin 


Gly 


Lys 


Asn 


His 










95 










100 










105 


Lys 


Met 


Arg 


He 


He 


Ala 


Phe 


Val 


Gly 


Ser 


Pro 


Val 


Glu 


Asp 


Asn 










110 










115 










120 


Glu 


Lys 


Asp 


Glu 


Val 


Asn 


Thr 


Glu 


Lys 


Leu 


Thr 


Ala 


Phe 


Val 


Asn 










125 










130 










135 


Thr 


Leu 


Asn 


Gly 


Lys 


Asp 


Gly 


Thr Gly 


Ser 


His 


Leu 


Val 


Thr 


Val 










140 










145 










150 


Pro 


Pro 


Gly 


Pro 


Ser 


Leu 


Ala 


Asp 


Ala 


Leu 


He 


Ser 


Ser 


Pro 


He 










155 










160 










165 


Leu 


Ala 


Gly 


Glu 


Gly 


Gly 


Ala 


Met 


Leu 


Gly 


Leu 


Gly 


Ala 


Ser 


Asp 










170 










175 










180 


Phe 


Glu 


Phe 


Gly 


Val 


Asp 


Pro 


Ser 


Ala 


Asp 


Pro 


Glu 


Leu 


Ala 


Leu 
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185 


Ala 


Leu 


Arg Val 


Ser 










200 


Glu 


Ala 


Arg 


Arg 


Ala 
215 


Ala 


Thr 


Thr 


Gly 


Thr 
230 


Thr 


He 


Ser 


Gin 


Gin 
245 


Ser 


Ser 


Met 


Thr 


Glu 
260 


Ser 


Leu 


Gin Gly 


Ala 










275 


Asp 


Ala 


Ser 


Ser 


Ala 
290 


Asp 


Asp 


Tyr 


Asp 


Val 
305 


Leu 


Glu 


Asn 


Leu 


Pro 
320 


Asn 


Ala 


Met 


Gly 


Ser 
335 


Lys 


Asp 


Lys 


Lys 


Glu 
350 



<210> 10 
<211> 132 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<223> Incyte ID No: 

<400> 10 



Met 


Gin 


Arg 


His 


Leu 


1 








5 


Ser 


Arg 


Asn 


Asp 


Gin 
20 


Leu 


Ser 


Tyr 


Tyr 


Thr 
35 


Arg 


He 


Leu 


Ala 


Val 
50 


Arg 


Leu 


Ser 


Ser 


He 

65 


Lys 


Tyr 


Pro 


Leu 


Pro 
80 


Phe 


Cys 


Thr 


Ser 


Gly 
95 


He 


Leu 


Gly 


Asn 


Val 
110 


Arg 


Thr 


Asn 


Asn 


Arg 
125 



<210> 11 

<211> 316 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 

<400> 11 











190 


Met 


Glu 


Glu 


Gin 


Arg 
205 


Ala 


Ala 


Ala 


Ser 


Ala 
220 


Glu 


Asp 


Ser 


Asp 


Asp 
235 


Glu 


Phe 


Gly 


Arg 


Thr 
250 


Glu 


Glu 


Gin 


He 


Ala 
265 


Glu 


Phe 


Gly 


Gin 


Ala 
280 


Met 


Asp 


Thr 


Ser 


Glu 
295 


Met 


Gin 


Asp 


Pro 


Glu 
310 


Gly 


Val 


Asp 


Pro 


Asn 
325 


Leu 


Ala 


Ser 


Gin 


Ala 
340 


Glu 


Asp 


Lys 


Lys 





7507064CD1 



Val 


Ala 


Gin 


Asp 


Leu 
10 


Gly 


Ser 


Met 


Leu 


Thr 
25 


Gly 


Ser 


Leu 


His 


Trp 
40 


His 


Cys 


Gly 


Gin 


Phe 
55 


Pro 


Thr 


Ala 


Val 


Phe 

70 


Pro 


Ser 


Ala 


Tyr 


Thr 
85 


Phe 


Gin 


Gly 


Asp 


Tyr 
100 


Phe 


He 


Trp 


Glu 


Tyr 
115 


Val 


Gly 


Leu 


Ala 


Lys 
130 



1439986CD1 











195 


Gin 


Arg 


Gin 


Glu 


Glu 
210 


Ala 


Glu 


Ala 


Gly 


He 
225 


Ala 


Leu 


Leu 


Lys 


Met 
240 


Gly 


Leu 


Pro 


Asp 


Leu 
255 


Tyr 


Ala 


Met 


Gin 


Met 
270 


Glu 


Ser 


Ala 


Asp 


He 
285 


Pro 


Ala 


Lys 


Glu 


Glu 
300 


Phe 


Leu 


Gin 


Ser 


Val 
315 


Asn 


Glu 


Ala 


He 


Arg 
330 


Thr 


Lys 


Asp 


Gly 


Lys 
345 



Phe 


Ser 


Val 


Tyr 


Met 
15 


Leu 


Arg 


Ala 


He 


Asp 
30 


He 


Pro 


Met 


Thr 


Ala 
45 


Asp 


He 


Asp 


Cys 


Gly 
60 


Glu 


He 


His 


Gly 


Lys 
75 


Ser 


Gin 


Asp 


Gin 


Gly 
90 


Ser 


Ser 


Gin 


Gin 


Trp 
105 


Tyr 


Ser 


Val 


Phe 


Asp 
120 


Ala 


Val 
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Met 


Gly 


Gin 


Asp 


Tyr 


Tyr 


Ser val 


Leu Gly 


He 


Thr 


Arg 


Asn 


Ser 


1 






5 








10 










15 


Glu 


Asp 


Ala 


Gin 


He 


Lys 


Gin Ala 


Tyr Arg 


Arg 


Leu 


Ala 


Leu 


Lys 








20 








25 










30 


His 


His 


Pro 


Leu 


Lys 


Ser 


Asn Glu 


Pro 


Ser 


Ser 


Ala 


Glu 


He 


Phe 










35 








40 










45 


Arg 


Gin 


He 


Ala 


Glu 


Ala 


Tyr Asp 


Val 


Leu 


Ser 


Asp 


Pro 


Met 


Lys 








50 








55 










60 


Arg 


Gly 


He 


Tyr 


Asp 


Lys 


Phe Gly Glu Glu Gly Leu Lys 


Gly Gly 






65 








70 










75 


lie 


Pro 


Leu 


Glu 


Phe 


Gly 


Ser Gin 


Thr 


Pro 


Trp 


Thr 


Thr 


Gly 


Tyr 










80 








85 










90 


Val 


Phe 


His 


Gly Lys 


Pro 


Glu Lys 


Val 


Phe 


His 


Glu 


Phe 


Phe 


Gly 










95 








100 










105 


Gly Asn Asn 


Pro 


Phe 


Ser 


Glu Phe 


Phe 


Asp 


Ala 


Glu 


Gly 


Ser 


Glu 










110 








115 










120 


vet -L 




Leu 


Asn 


Phe 


Gly 


Gly Leu Gin Gly Arg Gly Val Lys 


Lys 








125 








130 










135 


Gin 


Asp 


Pro 


Gin 


Val 


Glu 


Arg Asp 


Leu 


Tyr 


Leu 


Ser 


Leu 


Glu 


Asp 








140 








145 










150 


Leu 


Phe 


Phe 


Gly Cys 


Thr 


Lys Lys 


He 


Lys 


He 


Ser 


Arg 


Arg 


Val 










155 








160 










165 


Leu 


Asn 


Glu Asp Gly Tyr 


Ser Ser 


Thr 


He 


Lys 


Asp 


Lys 


He 


Leu 










170 








175 










180 


Thr 


lie 


Asp 


Val 


Lys 


Pro 


Gly Trp 


Arg 


Gin 


Gly Thr Arg 


He 


Thr 








185 








190 










195 


Phe 


Glu 


Lys 


Glu Gly 


Asp 


Gin Gly 


Pro 


Asn 


He 


He 


Pro 


Ala 


Asp 








200 








205 










210 


lie 


lie 


Phe 


He 


Val 


Lys 


Glu Lys 


Leu 


His 


Pro 


Arg 


Phe 


Arg 


Arg 










215 






220 










225 


Glu 


Asn 


Asp 


Asn 


Leu 


Phe 


Phe Val 


Asn 


Pro 


He 


Pro 


Leu 


Gly 


Lys 








230 








235 










240 


Ala 


Leu 


Thr 


Cys 


Cys 


Thr 


Val Glu 


Val 


Arg 


Thr 


Leu 


Asp 


Asp 


Arg 








245 








250 










255 


Leu 


Leu 


Asn 


He 


Pro 


He 


Asn Asp 


He 


He 


His 


Pro 


Lys 


Tyr 


Phe 










260 






265 










270 


Lys 


Lys 


Val 


Pro 


Gly Glu 


Gly Met 


Pro 


Leu 


Pro 


Glu 


Asp 


Pro 


Thr 






275 








280 










285 


Lys 


Lys 


Gly 


Asp 


Leu 


Phe 


He Phe 


Phe 


Asp 


He 


Gin 


Phe 


Pro 


Thr 






290 








295 










300 


Arg 


Leu 


Thr 


Pro 


Gin 


Lys 


Lys Gin 


Met 


Leu 


Arg 


Gin 


Ala 


Leu 


Leu 








305 








310 










315 



Thr 



<210> 12 

<211> 531 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2008979CD1 

<400> 12 



Met 


Lys 


Cys 


His 


Tyr 


Glu 


Ala 


Leu 


Gly Val 


Arg 


Arg 


Asp 


Ala 


Ser 


1 




5 








10 










15 


Glu 


Glu 


Glu 


Leu 


Lys 
20 


Lys 


Ala 


Tyr 


Arg Lys 
25 


Leu 


Ala 


Leu 


Lys 


Trp 
30 


His 


Pro 


Asp 


Lys 


Asn 


Leu 


Asp 


Asn 


Ala Ala 


Glu 


Ala 


Ala 


Glu 


Gin 








35 








40 










45 


Phe 


Lys 


Leu 


He 


Gin 


Ala 


Ala 


Tyr 


Asp Val 


Leu 


Ser 


Asp 


Pro 


Gin 








50 








55 










60 
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Glu Arg Ala 
Gly Phe Asp 
Tyr Phe Thr 
Gly Phe Tyr 
Glu Glu Leu 
Thr Phe Gly 
Phe Tyr Ala 
Trp Lys Glu 
Lys Arg Ala 
Arg Lys Glu 
Arg Lys Arg 
Glu Gin Asn 
Gin Gin Lys 
Gin Ser Trp 
Met Glu Ala 
Asn Glu Met 
Asp Ser Asp 
Cys Pro Ala 
Asn His Glu 
Lys Gin Gin 
lie Asp Glu 
Asp Ala Pro 
Gin Lys Pro 
Pro Gly Glu 
Gin Asp Ser 
Ser Val Thr 
Ala Lys Ser 
Lys Lys Pro 
Leu lie Ser 
Lys Leu Phe 
Ser Ser Ser 
Glu Lys Arg 



Trp 


Tyr 


Asp 




bb 




Gly 


Glu 


Tyr 




oU 




val 


Thr 


Cys 








Thr 


Val 


Tyr 








Glu 


Ser 


Val 




125 




Asp 


Ser 


Gin 




140 




Tyr 


Trp 


Gin 




155 




Glu 


Tyr 


Asp 




170 




Met 


Glu 


Lys 




185 




Lys 


Asn 


Glu 




200 




Asp 


Lys 


Arg 




215 




Ala 


Glu 


Lys 




230 




Leu 


Lys 


Gin 




245 




Met 


Thr 


Met 




260 




Arg 


Tyr 


Glu 




275 




Glu 


Glu 


His 




290 




Glu 


Ala 


Glu 




3 05 




Cys 


Asp 


Lys 




320 




Lys 


Ser 


Lys 




335 




Leu 


Glu 


Glu 




350 




Asn 


Pro 


Leu 




365 




Lys 


Gin 


Lys 




380 




Ala 


Gin 


Asn 




395 




Gly 


Val 


Lys 




410 




Ala 


Lys 


Glu 




425 




Glu 


lie 


lie 




440 




Val 


Pro 


Lys 




455 




Val 


Arg 


Val 








Cys 


Thr 


Thr 




485 




Asp 


His 


Leu 




500 




Ser 


Leu 


Asn 




515 




Lys 


Asn 


Arg 



Asn 


His 


Arg 


Gin 


Asp 


Asp 


Tyr 


Ser 


Gly 


Arg 


Asn 


Val 


Leu 


Glu 


Glu 


Ser 


Asp 


Tyr 


Ser 


Phe 


Cys 


Thr 


Arg 


Gin 


Glu 


Asn 


Lys 


Leu 


Val 


Arg 


Val 


Gin 


Ala 


Ala 


Arg 


Lys 


Ala 


Lys 


Leu 


Ala 


Asn 


Leu 


Lys 


Glu 


Phe 


Glu 


Leu 


Lys 


Asp 


Ala 


Glu 


Ser 


Phe 


Lys 


Lys 


His 


Arg 


Glu 


Glu 


Glu 


Asp 


Asp 


Asn 


Leu 


Ser 


Lys 


Tyr 


Asp 


Asp 


Val 


Asp 


Pro 


Leu 


Glu 


Asp 


Lys 


Pro 


Cys 


Pro 


Lys 


Gly 


Pro 


Ala 


Glu 


Cys 


His 


Ser 


Lys 


Ala 


Thr 


Ser 


Ala 


Thr 



CsXU 


TV 1 Q 

AXa 


Leu 


/U 






Ser 


Leu 


ASp 








Tyr 


Gly 


Asp 


*i i\f\ 
JLUU 






Pne 


GXU 


Met 


115 






Glu 


Val 


Asp 


130 






Asp 


Thr 


Val 


145 






Thr 


Gin 


Lys 


160 






Ala 


Ser 


Asn 


175 






Lys 


lie 


Arg 


190 






Gin 


Leu 


Val 


205 






His 


Arg 


Lys 


220 






Ala 


Glu 


Glu 


235 






Val 


Glu 


Gin 


250 






Glu 


Lys 


Glu 


265 






Gly 


Asp 


Gly 


280 






Asp 


Glu 


Glu 


295 






Leu 


Tyr 


Asp 


310 






Thr 


Glu 


Lys 


325 






Glu 


Met 


Val 


340 






Asn 


Phe 


Ser 


355 






Ser 


Glu 


Glu 


370 






Lys 


Gin 


Lys 


385 






Asn 


Phe 


Asn 


400 






Glu 


Asp 


Thr 


415 






Ser 


Pro 


Gin 


43 0 






Asp 


Asp 


Pro 


445 






Lys 


Lys 


Thr 


460 






Pro 


Gin 


Thr 


475 






Glu 


Phe 


Pro 


490 






Gly 


His 


Ala 


505 






Ser 


Ser 


Gin 


520 







Leu 


Lys 


GXy 






ID 


Leu 


Leu 


Arg 






y u 


AS£> 


Glu 


Lys 






He 


AX a 


Lys 






ion 
X^U 


Asp 


Phe 


Pro 






IOC 

X33 


Val 


His 


Pro 






150 


Asn 


Phe 


Ala 






165 


Arg 


Trp 


Glu 






180 


Asp 


Lys 


Ala 






195 


Ala 


Phe 


He 






210 


Leu 


Val 


Glu 






225 


Met 


Arg 


Arg 






240 


Tyr 


Arg 


Glu 






255 


Leu 


Gin 


Glu 






270 


Ser 


Asp 


Glu 






285 


Asp 


Gly 


Lys 






300 


Asp 


Leu 


Tyr 






315 


Ala 


Met 


Lys 






330 


Ala 


Leu 


Leu 






345 


Arg 


Pro 


Gin 






360 


Glu 


Met 


Glu 






375 


Lys 


Lys 


Lys 






390 


Val 


Asn 


Gly 






405 


Asn 


Leu 


Asn 






420 


Glu 


Asn 


Val 






435 


Lys 


Ser 


Glu 






450 


Lys 


Asp 


Met 








Met 


Ser 


Val 






480 


Ser 


Arg 


Asn 






495 


Arg 


Ala 


Pro 






510 


Ser 


Lys 


Lys 






525 
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530 

<210> 13 

<211> 108 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 90073157CD1 

<400> 13 



Met 


Ser 


Arg 


Pro 


Gin 


Leu 


Arg 


Arg 


1 








5 








Pro 


Ser 


Gly Val 


Pro 


Gly 


Leu 


Ala 










20 








Leu 


Ala 


Leu 


Arg 


Leu 
35 


Ala 


Ala 


Gly 


Pro 


Glu 


Leu 


Cys 


Arg 
50 


Pro 


He 


Arg 


Phe 


Val 


Phe 


Asp 


Val 
65 


Gly 


Gin 


Lys 


Ser 


Gin 


He 


Thr 


Thr 
80 


val 


Ala 


Thr 


Leu 


Met 


Cys 


Tyr 


Ala 


His 


Ser 


Lys 



95 

Gly Asn Leu 



<210> 14 

<211> 345 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7506782CD1 

<400> 14 



Met 


Ala 


Phe 


Leu 


Gly 


Leu 


Phe 


Ser 


1 








5 








Ala 


Thr 


Gly 


Ala 


Thr 
20 


Phe 


Pro 


Glu 


Val 


Asn 


Met 


Tyr 


Asn 
35 


Arg 


Leu 


Arg 


He 


Leu 


Phe 


Ser 


Pro 
50 


Leu 


Ser 


He 


Glu 


Leu 


Gly 


Ala 


Gin 
65 


Gly 


Ser 


Thr 


Met 


Gly Tyr 


Asp 


Ser 


Leu 


Lys 


• Asn 










80 








Lys 


Glu 


Phe 


Ser 


Asn 


Met 


Val 


Thr 








95 








Met 


Lys 


He 


Ala 


Asn 

110 


Ser 


Leu 


Phe 


Asn 


Glu 


Glu 


Phe 


Leu 
125 


Gin 


Met 


Met 


Val 


Asn 


His 


Val 


Asp 
140 


Phe 


Ser 


Gin 


He 


Asn 


Lys 


Trp 


Val 
155 


Glu 


Asn 


Asn 


Gly 


Ser 


Asn 


Glu 


Ala 
170 


Gly 


Gly 


He 



Trp 


Arg Leu 


Val 


Ser 


Ser 


Pro 


10 








15 


Leu 


Leu Ala 


Leu 


Leu 


Ala 


Leu 




25 








30 


Thr 


Asp Cys 


Pro 


Cys 


Pro 


Glu 




40 








45 


His 


His Pro 


Asp 


Phe 


Glu 


Val 




55 








60 


Thr 


Trp Lys 


Ser 


Tyr 


Asp 


Trp 




70 








75 


Phe 


Gly Lys 


Tyr 


Asp 


Ser 


Glu 




85 








90 


Gly Ala Arg Val Val 


Leu 


Lys 




100 








105 



Leu 


Leu Val 


Leu 


Gin 


Ser 


Met 




10 








15 


Glu 


Ala He 


Ala 


Asp 


Leu 


Ser 




25 








30 


Ala Thr Gly Glu 


Asp Glu 


Asn 




40 








45 


Ala 


Leu Ala 


Met 


Gly Met 


Met 




55 








60 


Gin 


Lys Glu 


He 


Arg 


His 


Ser 




70 t 








75 


Gly Glu Glu 


Phe 


Ser 


Phe 


Leu 




85 








90 


Ala 


Lys Glu 


Ser 


Gin 


Tyr 


Val 




100 








105 


Val 


Gin Asn Gly 


Phe 


His 


Val 




115 








120 


Lys 


Lys Tyr 


Phe 


Asn 


Ala 


Ala 




130 








135 


Asn 


Val Ala 


Val 


Ala 


Asn 


Tyr 




145 








150 


Thr 


Asn Arg 


Glu 


Phe 


Ser 


Asp 




160 








165 


Tyr 


Gin Val 


Leu 


Glu 


He 


Pro 


175 








180 
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Glu 


Gly Asp 


Glu 


He 


Ser 










185 






Glu 


Val 


Pro 


Leu 


Ala 
200 


Thr 


Leu 


Val 


Glu 


Glu 


Tro 


Ala 

215 


Asn 


Ser 


Tyr 


Leu 


Pro 


Arg 


Phe 
230 


Thr 


Val 


Val 


Leu 


Lys 


Ala 


Leu 
245 


Gly 


He 


Asn 


Leu 


Thr 


Glv 


Leu 
260 


Ser 


Asp 


Ala 


He 


Hxs 


Lys 


Ser 
275 


Phe 


Leu 


Ala 


Ala 


Ala 


Val 


Ser 
290 


Gly 


Met 


Leu 


Tyr 


Pro 


Gin 


Val 
305 


He 


Val 


Arg 


Asn 


Arg 


Arg 


Thr 
320 


Gly 


Thr 


His 


Pro 


Glu 


Thr 


Met 
335 


Asn 


Thr 



Met 


Met 


Leu 


Val 


Leu 


Ser 


Arg 


Gin 






190 










195 


Glu 


Pro 


Leu Val 


Lys 


Ala 


Gin 


Leu 






205 










210 


Val 


Lys 


Lys 


Gin 


Lys 


Val 


Glu 


Val 






220 










225 


Glu 


Gin 


Glu 


He 


Asp 


Leu 


Lys 


Asp 






235 










240 


Thr 


Glu 


He 


Phe 


He 


Lys 


Asp 


Ala 






250 










255 


Asn 


Lys 


Glu 


He 


Phe 


Leu 


Ser 


Lys 




265 










270 


Glu 


Val 


Asn 


Glu 


Glu 


Gly 


Ser 


Glu 






280 










285 


He 


Ala 


He 


Ser 


Arg 


Met 


Ala 


Val 






295 










300 


Asp 


His 


Pro 


Phe 


Phe 


Phe 


Leu 


He 




310 










315 


He 


Leu 


Phe 


Met 


Gly Arg Val 


Met 






325 










330 


Ser Gly His 


Asp 


Phe 


Glu 


Glu 


Leu 






340 










345 



<210> 15 

<211> 493 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> inisc_f eature 

<223> Incyte ID No: 7506941CD1 

<400> 15 



Met 


Pro 


Asp 


Gin 


Leu 


Glu 


Ser 


Leu 


Pro 


Leu 


Phe 


Ser 


Lys 


Lys 


Asn 


1 






5 










10 










15 


Leu 


He 


Asp 


He 


Leu 


Ala 


He 


Gly Gly Val 


Gin 


Lys 


Leu 


Lys 


Gin 








20 










25 










30 


Leu 


Pro 


Val 


Val 


Val 


Lys 


Phe 


Leu 


Glu 


Phe 


Tyr 


Met 


Lys 


Lys 


Met 










35 








40 










45 


Met 


Asn 


Leu 


Arg 


Trp 


Lys 


Leu 


Phe 


Met 


Leu 


His 


Pro 


Leu 


Cys 


Trp 








50 










55 










60 


Lys 


Gin 


Gly 


Arg 


Ala 


Asp 


Ser 


Phe 


Arg 


Tyr 


Pro 


Lys 


Thr 


Gly 


Thr 








65 










70 










75 


Ala 


Asn 


Pro 


Lys 


Val 


Thr 


Phe 


Lys 


Met 


Ser 


Glu 


He 


Met 


He 


Asp 








80 










85 










90 


Ala 


Glu 


Gly 


Arg 


He 


His 


Asp 


He 


Phe 


His 


Val 


Phe 


Pro 


Gin 


Ser 






95 










100 










105 


His 


Glu 


Glu 


Glu 


He 
110 


Glu 


Phe 


He 


Phe 


Ala 
115 


Ser 


Glu 


Cys 


Lys 


Thr 
120 


Gly 


Phe 


Arg 


His 


Leu 


Tyr 


Lys 


He 


Thr 


Ser 


He 


Leu 


Lys 


Glu 


Ser 






125 










13 0 










135 


Lys 


Tyr 


Lys 


Arg 


Ser 


Ser 


Gly 


Gly Leu 


Pro 


Ala 


Pro 


Ser 


Asp 


Phe 




140 










145 










150 


Lys 


Cys 


Pro 


He 


Lys 


Glu 


Glu 


He 


Ala 


He 


Thr 


Ser 


Gly Glu 


Trp 






155 










160 










165 


Glu 


Val 


Leu 


Gly 


Arg 


His 


Gly 


Ser 


Asn 


He 


Gin 


Val 


Asp 


Glu 


Val 








170 










175 










180 


Arg Arg 


Leu 


Val 


Tyr 


Phe 


Glu 


Gly Thr 


Lys 


Asp 


Ser 


Pro 


Leu 


Glu 










185 










190 










195 


His 


His 


Leu 


Tyr 


Val 


Val 


Ser 


Tyr 


Val 


Asn 


Pro 


Gly 


Glu 


Val 


Thr 








200 










205 










210 


Arg 


Leu 


Thr 


Asp 


Arg Gly 


Tyr 


Ser 


His 


Ser 


Cys 


Cys 


He 


Ser 


Gin 
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215 










220 










Z Z D 


His Cys 


Asp 


Phe 


Phe 


He 


Ser 


Lys 


Tyr 


Ser 


Asn 


Gin 


Lys 


Asn 


Pro 




230 










235 










o /I n 


His Cys 


Val 


Ser 


Leu 


Tyr 


Lys 


Leu 


Ser 


Ser 


Pro 


Glu 


Asp 


Asp 


Pro 






245 










250 










O c: Q 


Thr Cys 


Lys 


Thr 


Lys 


Glu 


Phe 


Trp 


Ala 


Thr 


He 


Leu 


Asp 


Ser 


Ala 




260 










265 










270 


Gly Pro 


Leu 


Pro 


Asp 


Tyr 


Thr 


Pro 


Pro 


Glu 


He 


Phe 


Ser 


Phe 


Glu 






275 










280 


• 








285 


Ser Thr 


Thr 


Gly 


Phe 


Thr 


Leu 


Tyr 


Gly Met 


Leu 


Tyr 


Lys 


Pro 


His 




290 










295 










300 


Asp lieu 


Gin 


Pro 


Gly 


Lys 


Lys 


Tyr 


Pro 


Thr 


Val 


lieu 


Phe 


He 


Tyr 






305 










310 










315 


Gly Gly 


Pro 


Gin 


Val 


Gin 


Leu 


Val 


Asn 


Asn 


Arg 


Phe 


Lys 


Gly Val 






320 










325 










330 


Lys Tyr 


Phe 


Arg 


Leu 


Asn 


Thr 


Leu 


Ala 


Ser 


Leu 


Gly Tyr Val Val 






335 










340 










34b 


val Val 


lie 


Asp 


Asn 


Arg 


Gly 


Ser 


Cys 


His 


Arg 


Gly Leu 


Lys 


Phe 




350 










355 










360 


Glu Gly Ala 


Phe 


Lys 


Tyr 


Lys 


Met 


Val 


Ala 


He Ala Gly Ala 


Pro 








365 










370 










375 


Val Thr 


Leu 


Trp 


He 


Phe 


Tyr 


Asp 


Thr 


Gly Tyr 


Thr 


Glu 


Arg 


Tyr 




380 










385 










390 


Met Glv His 


Pro 


Asp 


Gin 


Asn 


Glu 


Gin Gly Tyr 


Tyr 


Leu 


Gly 


Ser 








395 










400 










405 


Val Ala 


Met 


Gin 


Ala 
410 


Glu 


Lys 


Phe 


Pro 


Ser 
415 


Glu 


Pro 


Asn 


Arg 


Leu 

420 


Leu Leu 


Leu 


His 


Gly 
425 


Phe 


Leu 


Asp 


Glu 


Asn 
430 


Val 


His 


Phe 


Ala 


His 
435 


Thr Ser 


He 


Leu 


Leu 


Ser 


Phe 


Leu 


Val 


Arg 


Ala 


Gly 


Lys 


Pro 


Tyr 






440 










445 










450 


Asp Leu 


Gin 


He 


Tyr 


Pro 


Gin 


Glu 


Arg 


His 


Ser 


He 


Arg 


Val 


Pro 






455 










460 










465 


Glu Ser 


Gly Glu 


His 


Tyr 


Glu 


Leu 


His 


Leu 


Leu 


His 


Tyr 


Leu 


Gin 








470 










475 










480 


Glu Asn 


Leu 


Gly 


Ser 


Arg 


He 


Ala 


Ala 


Leu 


Lys 


Val 


He 










485 










490 













<210> 16 
<211> 204 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc„f eature 

<223> Incyte ID No: 7507072CD1 



<400> 16 










Met 


Gin 


Gly 


Thr 


Pro 


Gly 


Gly 


1 








5 






Val 


Asp 


Arg 


Arg 


Thr 
20 


Leu 


Leu 


Ala 


Leu 


Gly 


Gin 


Met 
35 


Asn 


Phe 


Leu 


Ala 


Lys 


Asp 


Glu 
50 


Lys 


Gin 


Gly 


Leu 


Lys 


Pro 


Gin 


Lys 


Val 








65 






Pro 


Ser 


Leu 


Pro 


Val 
80 


Asp 


Met 


Asp 


He 


Lys 


Ala 


Tyr 


Leu 


Glu 








95 






Met 


He 


Lys 


Asp 


He 


Gin 


Val 



Gly 


Thr 


Arg 


Pro 


Gly Pro 


Ser 


Pro 




10 








15 


val 


Phe 


Ser 


Phe 


He Leu 


Ala 


Ala 






25 








30 


Thr 


Gly Asp 


Gin 


Val Leu 


Arg 


Val 






40 








45 


Leu 


Ser 


Leu 


Leu 


Gly Asp 


Leu 


Glu 






55 








60 


Asp 


Phe 


Trp 


Arg 


Gly Pro 


Ala 


Arg 




70 








75 


Arg 


Val 


Pro 


Phe 


Ser Glu 


Leu 


Lys 




85 








90 


Ser 


His 


Gly 


Leu 


Ala Tyr 


Ser 


He 






100 








105 


Leu 


Leu 


Asp 


Glu 


Glu Arg Arg Ala 
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110 115 120 



Met 


Ala 


Lys 


Ser 


Arg 


Arg 


Leu 


Glu 


Arg 


Ser 


Thr 


Asn 


Ser 


Phe 


Ser 










125 










130 










13 5 


Tyr 


Ser 


Ser 


Tyr 


His 


Tnr 


Leu 


Glu 


Glu 


lie 


Tyr 


Ser 


Trp 


He 


Asp 










140 










145 










150 


Asn 


Phe 


Val 


Met 


Glu 


His 


Ser 


Asp 


lie 


Val 


Ser 


Lys 


He 


Gin 


He 










155 










160 










165 


Gly Asn 


Ser 


Phe 


Glu 


Asn 


Gin 


Ser 


He 


Leu 


Val 


Leu 


Lys 


Glu 


Met 










170 










175 










180 


Val 


Leu 


Thr 


Ala 


Thr 


Pro 


Ala 


Gin 


Lys 


Leu 


He 


Thr 


Gly Pro 


Pro 










185 










190 










195 


Leu 


Ser 


Arg 


Ser 


Arg 


Arg 


Trp 


Leu 


Pro 















200 

<210> 17 
<211> 224 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> inisc_f eature 

<223> Incyte ID No: 7507083CD1 

<400> 17 



Met 


Ala 


Asp 


Glu 


Lys 


Pro 


Ser 


Asn 


Gly 


Val 


Leu 


Val 


His 


Met 


Val 


1 






5 










10 










15 


Lys 


Leu 


Leu 


He 


Lys 


Thr 


Phe 


Leu 


Asp 


Gly 


He 


Phe 


Asp 


Asp 


Leu 










20 










25 










30 


Met 


Glu 


Asn 


Asn 


Val 


Leu 


Asn 


Thr 


Asp 


Glu 


He 


His 


Leu 


He 


Gly 










35 










40 










45 


Lys 


Cys 


Leu 


Lys 


Phe 


Val 


Val 


Ser 


Asn 


Ala 


Glu 


Asn 


Leu 


Val 


Asp 










50 










55 










60 


Asp 


He 


Thr 


Glu 


Thr 


Ala 


Gin 


He 


Ala 


Gly Lys 


He 


Phe 


Arg 


Glu 










65 










70 










75 


His 


Leu 


Trp 


Asn 


Ser 


Lys 


Lys 


Gin 


Leu 


Ser 


Ser 


Asp 


Val 


Ser 


Ser 










80 










85 










90 


Asp 


Gly 


Glu 


Arg 


Glu 


Ala 


Asn 


Met 


Pro 


Gly Leu Asn 


He 


Arg 


Asn 










95 










100 










105 


Lys 


Glu 


Phe 


Asn 


Tyr 


Leu 


His 


Asn 


Arg 


Asn 


Gly 


Ser 


Glu 


Leu 


Asp 










110 










115 










120 


Leu 


Leu 


Gly 


Met 


Arg 


Asp 


Leu 


Leu 


Glu 


Asn 


Leu 


Gly Tyr 


Ser 


Val 










125 










130 










135 


Val 


He 


Lys 


Glu 


Asn 


Leu 


Thr 


Ala 


Gin 


Met 


Val 


Leu 


Gly Leu 


Phe 










140 










145 










150 


Gly 


Ser 


Pro 


Leu 


Thr 


Val 


Glu 


Lys 


Pro 


Val 


Gin 


He 


Leu 


Met 


Val 










155 










160 










165 


Gly 


Ser 


Cys 


Lys 


Val 


Thr 


Ser 


Val 


Met 


Met 


Leu 


Leu 


Gin 


Arg 


Leu 










170 










175 










180 


Met 


Trp 


Lys 


Arg 


Thr 


Ser 


Leu 


Leu 


Ser 


Asn 


Leu 


Pro 


His 


His 


He 










185 










190 










195 


Met 


Phe 


Leu 


Gly Asp 


Met 


Lys 


Gin 


Met 


Ala 


Pro 


Ser 


Ser 


Phe 


Pro 










200 










205 










210 


Lys 


Leu 


Ser 


Thr 


Thr 


Ser 


Glu 


Ser 


He 


Leu Gly Val 


He 


He 





215 220 

<210> 18 
<211> 277 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7509097CD1 
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<400> 18 




















Met 


Pro 


Glu 


Lys 


Pro 


Gly 


Pro 


Arg Trp He 


Phe 


Ser Ser 


Thr 


Ser 


1 






5 






10 








15 


Ala 


Lys 


Pro 


Gly Gly 


Leu 


Phe 


Pro Ser Glu 


Thr 


Met Glu 


Arg 


Asp 








20 






25 








30 


Ser 


His 


Gly Asn Ala 


Ser 


Pro 


Ala Arg Thr 


Pro 


Ser Ala 


Gly Ala 










35 






40 








45 


Ser 


Pro 


Ala 


Gin 


Ala 


Ser 


Pro 


Ala Gly Thr 


Pro 


Pro Gly Arg Ala 










50 






55 








60 


Ser 


Pro 


Ala 


Gin Ala 


Ser 


Pro 


Ala Gin Ala 


Ser 


Pro Ala 


Gly 


Thr 










65 






70 








75 


Pro 


Pro 


Gly Arg Ala 


Ser 


Pro 


Ala Gin Ala 


Ser 


Pro Ala Gly Thr 










80 






85 








90 


Pro 


Pro 


Gly Arg Ala 


Ser 


Pro 


Gly Arg Ala 


Ser 


Pro Ala 


Gin 


Ala 










95 






100 








105 


Ser 


Pro 


Ala 


Gin 


Ala 


Ser 


Pro 


Ala Arg Ala 


Ser 


Pro Ala 


Leu 


Ala 








110 






115 








120 


Ser 


Leu 


Ser 


Arg 


Ser 


Ser 


Ser 


Gly Arg Ser 


Ser 


Ser Ala 


Arg 


Ser 








125 






130 








135 


Ala 


Ser 


Val 


Thr 


Thr 
140 


Ser 


Pro 


Thr Arg Val 
145 


Tyr 


Leu Val 


Arg 


Ala 
150 


Thr 


Pro 


Val 


Gly Ala Val 


Pro 


He Arg Ser 


Ser 


Pro Ala 


Arg 


Ser 










155 






160 








165 


Ala 


Pro 


Ala 


Thr 


Arg 
170 


Ala 


Thr 


Arg Glu Ser 
175 


Pro 


Gly Thr 


Ser 


Leu 
180 


Pro 


Lys 


Phe 


Thr 


Trp 


Arg 


Glu 


Gly Gin Lys 


Gin 


Leu Pro 


Leu 


He 








185 






190 








195 


Gly 


Cys 


Val 


Leu 


Leu 


Leu 


He 


Ala Leu Val 


Val 


Ser Leu 


He 


He 






200 






205 








210 


Leu 


Phe 


Gin 


Phe 


Trp 
215 


Gin 


Gly 


His Thr Gly 
220 


He 


Arg Tyr 


Lys 


Glu 
225 


Gin 


Arg 


Glu 


Ser 


Cys 


Pro 


Lys 


His Ala Val 


Arg 


Cys Asp 


Gly Val 








230 






235 








240 


Val 


Asp 


Cys 


Lys 


Leu 


Lys 


Ser 


Asp Glu Leu 


Gly 


Cys Val 


Arg 


Phe 




245 






250 








255 


Asp 


Trp 


Asp 


Lys 


Ser 


Leu 


Leu 


Lys He Tyr 


Ser 


Gly Ser 


Ser 


His 






260 






265 








270 


Gin 


Tlip 


Leu 


Pro 


Thr 
275 


Ala 


Asp 













<210> 19 

<211> 449 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7509118CD1 

<400> 19 



Met 


Pro 


Glu 


Lys 


Pro 


Gly 


Pro 


Arg 


Trp He 


Phe 


Ser 


Ser 


Thr 


Ser 


1 






5 








10 










15 


Ala 


Lys 


Pro 


Gly 


Gly Leu 


Phe 


Pro 


Ser Glu 


Thr 


Met 


Glu 


Arg 


Asp 






20 








25 










30 


Ser 


His 


Gly 


Asn 


Ala 


Ser 


Pro 


Ala 


Arg Thr 


Pro 


Ser 


Ala 


Gly 


Ala 








35 








40 










45 


Ser 


Pro 


Ala 


Gin 


Ala 


Ser 


Pro 


Ala 


Gly Thr 


Pro 


Pro 


Gly Arg 


Ala 










50 








55 










60 


Ser 


Pro 


Ala 


Gin 


Ala 


Ser 


Pro 


Ala 


Gin Ala 


Ser 


Pro 


Ala 


Gly 


Thr 






65 








70 










75 


Pro 


Pro 


Gly 


Arg 


Ala 


Ser 


Pro 


Ala 


Gin Ala 


Ser 


Pro 


Ala 


Gly 


Thr 






80 








85 










90 


Pro 


Pro 


Gly 


Arg 


Ala 


Ser 


Pro 


Gly Arg Ala 


Ser 


Pro 


Ala 


Gin 


Ala 
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95 










100 










1 

XUD 


Ser 


Pro 


Ala 


Gin 


Ala 


Ser 


Pro 


Ala 


Arg 


Ala 


Ser 


Pro 


Ala 


Leu 


TV n ca 






110 










lie 

lib 












Ser 


Leu 


Ser 


Arg 


Ser 


Ser 


Ser 


Gly Arg 


Ser 


Ser 


Ser 


Ala 


Arg 


Ser 










125 










130 












Ala 


Ser 


Val 


Thr 


Thr 


Ser 


Pro 


Thr 


Arg 


Val 


Tyr 


Leu 


Val 


Arg 


Aia 








140 










145 












Thr 


Pro 


Val 


Gly Ala Val 


Pro 


He 


Arg 


Ser 


Ser 


Pro 


Ala 


Arg 


Ser 










155 










160 










103 


Ala 


Pro 


Ala 


Thr 


Arg 


Ala 


Thr 


Arg 


Glu 


Ser 


Pro 


Gly Thr 


Ser 


Leu 










170 










175 










180 


Pro 


Lys 


Phe Thr Trp Arg Glu Gly Gin Lys 


Gin 


Leu 


Pro 


Leu 


He 








185 










190 










195 


Glv Cys 


Val 


Leu 


Leu 


Leu 


He 


Ala 


Leu 


Val 


Val 


Ser 


Leu 


He 


He 










200 










205 










210 


Leu 


Phe Gin Phe Trp Gin Gly His 


Thr 


Gly 


He 


Arg 


Tyr 


Lys 


Glu 










215 










220 










225 


Gin Arg 


Glu 


Ser 


Cys 


Pro 


Lys 


His 


Ala 


Val 


Arg 


Cys 


Asp 


Gly Val 










230 










235 










240 


Val 


Asp 


Cys 


Lys 


Leu 


Lys 


Ser 


Asp 


Glu 


Leu 


Gly Cys Val 


Arg 


Phe 




245 










250 










255 


Asp 


Trp 


Asp 


Lys 


Ser 


Leu 


Leu 


Lys 


He 


Tyr 


Ser Gly Ser 


Ser 


His 


260 










265 










270 


Gin 


Trp 


Leu 


Pro 


lie 


Cys 


Ser 


Ser 


Asn 


Trp 


Asn 


Asp 


Ser 


Tyr 


Ser 








275 










280 










285 


Glu 


Lys 


Thr 


Cys 


Gin 


Gin 


Leu 


Gly 


Phe 


Glu 


Ser 


Ala 


His 


Arg 


Thr 






290 










295 










300 


Thr 


Glu 


Val 


Ala 


His 
305 


Arg 


Asp 


Phe 


Ala 


Asn 
310 


Ser 


Phe 


Ser 


He 


Leu 
315 


Arg 


Tyr 


Asn 


Ser 


Thr 


lie 


Gin 


Glu 


Ser 


Leu 


His 


Arg 


Ser 


Glu 


Cys 






320 










325 










330 


Pro 


Ser 


Gin 


Arg 


Tyr 


He 


Ser 


Leu 


Gin 


Cys 


Ser 


His 


Cys 


Gly 


Leu 








335 










340 










345 


Arg 


Ala 


Met 


Thr 


Gly Arg 


He 


Val 


Gly 


Gly Ala 


Leu 


Ala 


Ser 


Asp 








350 










355 










360 


Ser 


Lys 


Trp 


Pro 


Trp 


Gin 


Val 


Ser 


Leu 


His 


Phe 


Gly 


Thr 


Thr 


His 






365 










370 










375 


lie 


Cys 


Gly Gly Thr 


Leu 


He 


Asp 


Ala 


Gin 


Trp 


Val 


Leu 


Thr 


Ala 








380 










385 










390 


Ala 


His 


Cys 


Phe 


Phe 


Val 


Thr 


Arg 


Glu 


Lys 


Val 


Leu 


Glu 


Gly Trp 








395 










400 










405 


Lys 


Val 


Tyr 


Ala 


Gly 


Thr 


Ser 


Asn 


Ser 


Tyr 


Pro 


Gly Pro 


Lys 


Ala 






410 










415 










420 


Ser 


Ala 


Gly Gin 


Lys 


Ser 


Lys 


Thr 


Leu 


Lys 


Asp 


Pro 


Tyr 


Met 


Glu 










425 










430 










435 


His 


Phe 


Cys 


Phe 


lie 


He 


Arg 


Glu 


Thr 


Glu 


Ala 


Gin Gly 


Leu 












440 










445 













<210> 20 
<211> 451 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7509312CD1 

<400> 20 

Met Glu Arg Asp Ser His Gly Asn Ala Ser Pro Ala Arg Thr Pro 
15 10 15 

Ser Ala Gly Ala Ser Pro Ala Gin Ala Ser Pro Ala Gly Thr Pro 
20 25 30 

Pro Gly Arg Ala Ser Pro Ala Gin Ala Ser Pro Ala Gin Ala Ser 
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35 40 45 

Pro Ala Arg Ala Ser Pro Ala Leu Ala Ser Leu Ser Arg Ser Ser 
50 55 60 

Ser Gly Arg Ser Ser Ser Ala Arg Ser Ala Ser Val Thr Thr Ser 
65 70 75 

Pro Thr Arg Val Tyr Leu Val Arg Ala Thr Pro Val Gly Ala Val 
80 85 90 

Pro lie Arg Ser Ser Pro Ala Arg Ser Ala Pro Ala Thr Arg Ala 
95 100 105 

Thr Arg Glu Ser Pro Gly Thr Ser Leu Pro Lys Phe Thr Trp Arg 
110 115 120 

Glu Gly Gin Lys Gin Leu Pro Leu lie Gly Cys Val Leu Leu Leu 
125 130 135 

lie Ala Leu Val Val Ser Leu lie He Leu Phe Gin Phe Trp Gin 
140 145 150 

Glv His Thr Gly He Arg Tyr Lys Glu Gin Arg Glu Ser Cys Pro 
155 160 165 

Lvs His Ala Val Arg Cys Asp Gly Val Val Asp Cys Lys Leu Lys 
170 175 180 

Ser Asp Glu Leu Gly Cys Val Arg Phe Asp Trp Asp Lys Ser Leu 
185 190 195 

Leu Lys He Tyr Ser Gly Ser Ser His Gin Trp Leu Pro He Cys 
200 205 210 

Ser Ser Asn Trp Asn Asp Ser Tyr Ser Glu Lys Thr Cys Gin Gin 
215 220 225 

Leu Glv Phe Glu Ser Ala His Arg Thr Thr Glu Val Ala His Arg 
230 235 240 

Asp Phe Ala Asn Ser Phe Ser He Leu Arg Tyr Asn Ser Thr He 
245 250 255 

Gin Glu Ser Leu His Arg Ser Glu Cys Pro Ser Gin Arg Tyr He 
260 265 270 

Ser Leu Gin Cys Ser His Cys Gly Leu Arg Ala Met Thr Gly Arg 
275 280 285 

He Val Gly Gly Ala Leu Ala Ser Asp Ser Lys Trp Pro Trp Gin 
290 295 300 

val Ser Leu His Phe Gly Thr Thr His He Cys Gly Gly Thr Leu 
305 310 315 

He Asp Ala Gin Trp Val Leu Thr Ala Ala His Cys Phe Phe Val 
320 325 330 

Thr Arg Glu Lys Val Leu Glu Gly Trp Lys Val Tyr Ala Gly Thr 
335 340 345 

Ser Asn Leu His Gin Leu Pro Glu Ala Ala Ser He Ala Glu lie 
350 355 360 

He He Asn Ser Asn Tyr Thr Asp Glu Glu Asp Asp Tyr Asp He 
365 370 375 

Ala Leu Met Arg Leu Ser Lys Pro Leu Thr Leu Ser Gly Glu Gly 
380 385 390 

He Cvs Thr Pro Arg Ser Pro Ala Pro Gin Pro Gin His Pro Leu 
395 400 405 

Gin Pro Ser His Leu Ser Ala Ser Val Asn Ser Tyr Pro Gly Pro 
410 415 420 

Lys Ala Ser Ala Gly Gin Lys Ser Lys Thr Leu Lys Asp Pro Tyr 
425 430 435 

Met Glu His Phe Cys Phe He He Arg Glu Thr Glu Ala Gin Gly 
440 445 450 

Leu 



<210> 21 

<211> 485 

<212> PRT 

<213> Homo sapiens 
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<220> 

<221> inisc_feature 

<223> Incyte ID No: 90126902CD1 



<400> 21 

Met Gly Lys Leu Arg Pro Gly Arg Val Glu Trp Leu Ala Ser Gly 
15 10 15 

His Thr Glu Arg Pro His Leu Phe Gin Asn Leu Leu Leu Phe Leu 
20 25 30 

Trp Ala Leu Leu Asn Cys Gly Leu Gly Val Ser Ala Gin Gly Pro 
35 40 45 

Gly Glu T3CP Thr Pro Trp Val Ser Trp Thr Arg Cys Ser Ser Ser 
50 55 60 

Cys Gly Arg Gly Val Ser Val Arg Ser Arg Arg Cys Leu Arg Leu 
65 70 75 

Pro Gly Glu Glu Pro Cys Trp Gly Asp Ser His Glu Tyr Arg Leu 
80 85 90 

Cys Gin Leu Pro Asp Cys Pro Pro Gly Ala Val Pro Phe Arg Asp 
95 100 105 

Leu Gin Cys Ala Leu Tyr Asn Gly Arg Pro Val Leu Gly Thr Gin 
110 115 120 

Lvs Thr Tyr Gin Trp Val Pro Phe His Gly Ala Pro Asn Gin Cys 
125 130 135 

Asp Leu Asn Cys Leu Ala Glu Gly His Ala Phe Tyr His Ser Phe 
140 145 150 

Gly Arg Val Leu Asp Gly Thr Ala Cys Ser Pro Gly Ala Gin Gly 
155 160 165 

Val Cys Val Ala Gly Arg Cys Leu Ser Ala Gly Cys Asp Gly Leu 
170 175 180 

Leu Gly Ser Gly Ala Leu Glu Asp Arg Cys Gly Arg Cys Gly Gly 
185 190 195 

Ala Asn Asp Ser Cys Leu Phe Val Gin Arg Val Phe Arg Asp Ala 
200 205 210 

Gly Ala Phe Ala Gly Tyr Trp Asn Val Thr Leu lie Pro Glu Gly 
215 220 225 

Ala Arg His He Arg Val Glu His Arg Ser Arg Asn His Leu Gly 
230 235 240 

He Leu Gly Ser Leu Met Gly Gly Asp Gly Arg Tyr Val Leu Asn 
245 250 255 

Gly His Trp Val Val Ser Pro Pro Gly Thr Tyr Glu Ala Ala Gly 
260 265 270 

Thr His Val Val Tyr Thr Arg Asp Thr Gly Pro Gin Glu Thr Leu 
275 280 285 

Gin Ala Ala Gly Pro Thr Ser His Asp Leu Leu Leu Gin Val Leu 
290 295 300 

Leu Gin Glu Pro Asn Pro Gly He Glu Phe Glu Phe Trp Leu Pro 
305 310 315 

Arg Glu Arg Tyr Ser Pro Phe Gin Ala Arg Val Gin Ala Leu Gly 
320 325 330 

T2:p Pro Leu Arg Gin Pro Gin Pro Arg Gly Val Glu Pro Gin Pro 
335 340 345 

Pro Ala Ala Pro Ala Val Thr Pro Ala Gin Thr Pro Thr Leu Ala 
350 355 360 

Pro Asp Pro Cys Pro Pro Cys Pro Asp Thr Arg Gly Arg Ala His 
365 370 375 

Arg Leu Leu His Tyr Cys Gly Ser Asp Phe Val Phe Gin Ala Arg 
380 385 390 

Val Leu Gly His His His Gin Ala Gin Glu Thr Arg Tyr Glu Val 
395 400 405 

Arg He Gin Leu Val Tyr Lys Asn Arg Ser Pro Leu Arg Ala Arg 
410 415 420 

Glu Tyr Val Trp Ala Pro Gly His Cys Pro Cys Pro Met Leu Ala 
425 430 ^ 435 
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Pro His Arg Asp Tyr Leu Met Ala Val Gin Arg Leu Val Ser Pro 

440 445 450 

Asp Gly Thr Gin Asp Gin Leu Leu Leu Pro His Ala Gly Tyr Ala 

455 460 465 

Arg Pro Trp Ser Pro Ala Glu Asp Ser Arg lie Arg Leu Thr Ala 

470 475 480 

Arg Arg Cys Pro Gly 

485 

<210> 22 
<211> 258 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Xncyte ID No: 7509352CD1 

<400> 22 

Met Ala Trp Ser Pro Pro Ala Thr Leu Phe Leu Phe Leu Leu Leu 
15 10 15 

Leu Gly Gin Pro Pro Pro Ser Arg Pro Gin Ser Leu Gly Thr Thr 
20 25 30 

Lys Leu Arg Leu Val Gly Pro Glu Ser Lys Pro Glu Glu Gly Arg 
35 40 45 

Leu Glu Val Leu His Gin Gly Gin Trp Gly Thr Val Cys Asp Asp 
50 55 60 

Asn Phe Ala lie Gin Glu Ala Thr Val Ala Cys Arg Gin Leu Gly 
65 70 75 

Phe Glu Ala Ala Leu Thr Trp Ala His Ser Ala Lys Tyr Gly Gin 
80 85 90 

Gly Glu Gly Pro lie Trp Leu Asp Asn Val Arg Cys Val Gly Thr 
95 100 105 

Glu Ser Ser Leu Asp Gin Cys His Arg His Tyr His Ser lie Glu 
110 115 120 

Val Phe Thr His Tyr Asp Leu Leu Thr Leu Asn Gly Ser Lys Val 
125 130 135 

Ala Glu Gly His Lys Ala Ser Phe Cys Leu Glu Asp Thr Asn Cys 
140 145 150 

Pro Thr Gly Leu Gin Arg Arg Tyr Ala Cys Ala Asn Phe Gly Glu 
155 160 165 

Gin Gly Val Thr Val Gly Cys Trp Asp Thr Tyr Arg His Asp He 
170 175 180 

Asp Cys Gin Trp Val Asp He Thr Asp Val Gly Pro Gly Asn Tyr 
185 190 195 

He Phe Gin Val He Val Asn Pro His Tyr Glu Val Ala Glu Ser 
200 205 210 

Asp Phe Ser Asn Asn Met Leu Gin Cys Arg Cys Lys Tyr Asp Gly 
215 220 225 

His Arg Val Trp Leu His Asn Cys His Thr Gly Asn Ser Tyr Pro 
230 235 240 

Ala Asn Ala Glu Leu Ser Leu Glu Gin Glu Gin Arg Leu Arg Asn 
245 250 255 

Asn Leu lie 



<210> 23 
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<223> Incyte ID No: 7509341CD1 



<400> 23 

Met Gly Leu Arg Ala Gly Pro He Leu Leu Leu Leu Leu Trp Leu 
15 10 15 

Leu Pro Gly Ala His Trp Asp Val Leu Pro Ser Glu Cys Gly Hxs 
20 25 30 

Ser Lys Glu Ala Gly Arg He Val Gly Gly Gin Asp Thr Gin Glu 
35 40 45 

Glv Arg Trp Pro Trp Gin Val Gly Leu Trp Leu Thr Ser Val Gly 
50 55 60 

His Val Cys Gly Gly Ser Leu He His Pro Arg Trp Val Leu Thr 
65 70 75 

Ala Ala His Cys Phe Leu Arg Ser Glu Asp Pro Gly Leu Tyr His 
80 85 90 

Val Lys Val Gly Gly Leu Thr Pro Ser Leu Ser Glu Pro His Ser 
95 100 105 

Ala Leu Val Ala Val Arg Arg Leu Leu Val His Ser Ser Tyr His 
110 115 120 

Glv Thr Thr Thr Ser Gly Asp He Ala Leu Met Glu Leu Asp Ser 
125 130 135 

Pro Leu Gin Ala Ser Gin Phe Ser Pro He Cys Leu Pro Gly Pro 
140 145 150 

Gin Thr Pro Leu Ala He Gly Thr Val Cys Trp Gly Asp Ser Gly 
155 160 165 

Glv Pro Leu Val Cys Pro He Asn Asp Thr Trp He Gin Ala Gly 
170 175 180 

He Val ser Trp Gly Phe Gly Cys Ala Arg Pro Phe Arg Pro Gly 
185 190 195 

Val Tyr Thr Gin Val Leu Ser Tyr Thr Asp Trp He Gin Arg Thr 
200 205 210 

Leu Ala Glu Ser His Ser Gly Met Ser Gly Ala Arg Pro Gly Ala 
215 220 225 

Pro Gly Ser His Ser Gly Thr Ser Arg Ser His Pro Val Leu Leu 
230 235 240 

Leu Glu Leu Leu Thr Val Cys Leu Leu Gly Ser Leu 
245 250 



<210> 24 

<211> 102 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 75093 67CD1 



<400> 24 



Met 


Gly 


Leu 


Arg 


Ala 


Gly 


Pro 


1 








5 






Leu 


Pro 


Gly 


Ala 


His 
20 


Trp 


Asp 


Ser 


Lys 


Glu 


Ala 


Gly Arg 


He 










35 






Gly 


Arg 


Trp 


Pro 


Trp 
50 


Gin 


Val 


His 


Val 


Cys 


Gly 


Gly 
65 


Ser 


Leu 


Ala 


Ala 


His 


Cys 


Phe 
80 


Leu 


Arg 


Ala 


Pro 


Ser 


Met 


He 


Arg 


Gly 



He 


Leu 


Leu 


Leu Leu 


Leu 


Trp 


Leu 






10 








15 


Val 


Leu 


Pro 


Ser Glu 


Cys 


Gly 


His 






25 








30 


Val 


Gly 


Gly Gin Asp 


Thr 


Gin 


Glu 




.40 








45 


Gly Leu Trp 


Leu Thr 


Ser 


Val 


Gly 






55 








60 


He 


His 


Pro 


Arg Trp 


Val 


Leu 


Thr 






70 








75 


Val 


Thr 


Pro 


Gly Gly Arg 


Trp 


Ser 






85 








90 


Ser 


Arg 


Pro 


Ala Leu 












100 
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<210> 25 

<211> 62 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7500455CD1 

<400> 25 

Met Pro Ala Val Ala Ser Val Pro Lys Glu Leu Tyr Leu Ser Ser 
15 10 15 

Ser Leu Lys Asp Leu Asn Lys Lys Thr Glu Val Lys Pro Glu Lys 

20 25 30 

lie Ser Thr Lys Arg lie He Ser He Gin Tyr Leu Asp Leu Glu 

35 40 45 

Thr Ser Lys Lys Leu Ser Lys Lys Leu Lys Asp Ser Leu Lys Ala 

50 55 60 

Leu Asn 



<210> 26 

<211> 181 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7510401CD1 

<400> 26 
Met Ala Glu L 
1 



Thr 


Ala 


Leu Glu 


Ser 


Leu 


He 


Glu 


Met 


Gly 


Phe 


5 








10 










15 


Ala 


Glu 


Lys Ala 


Leu 


Ala 


Leu 


Thr 


Gly 


Asn 


Gin 


20 






25 










30 


Ala 


Met 


Asp Trp 


Leu 


Met 


Glu 


His 


Glu 


Asp 


Asp 


35 






40 










45 


Glu 


Pro 


Leu Glu 


Thr 


Pro 


Leu 


Gly 


His 


He 


Leu 


50 








55 










60 


Thr 


Ser 


Ser Glu 


Gin 


Gly Gly 


Leu 


Glu 


Gly 


Ser 


65 








70 










75 


Gly Glu 


Gly Lys 


Pro 


Ala 


Leu 


Ser 


Glu 


Glu 


Glu 


80 








85 










90 


Thr 


Lys 


Arg Met 


Leu 


Glu 


Leu 


Val 


Ala 


Gin 


Lys 


95 




100 










105 


Glu 


Glu 


Arg Glu 


Glu 


Arg 


Glu 


Ala 


Leu 


Glu 


Arg 


110 






115 










120 


Arg 


Arg 


Gin Gly Gin 


Glu 


Leu 


Ser 


Ala 


Ala 


Arg 


125 






130 










135 


Glu 


Asp 


Glu Met 


Arg 


Arg 


Ala 


Ala 


Glu 


Glu 


Arg 


140 






145 










150 


Ala 


Glu 


Glu Leu 


Ala 


Ala 


Arg 


Gin 


Arg 


Val 


Arg 


155 








160 










165 


Arg 


Asp 


Lys Ala 


Glu 


Arg 


Ala 


Lys 


Lys 


Val 


Gly 


170 






175 










180 



Asp 
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<220> 

<221> inisc_feature 

<223> Incyte ID No: 7504702CD1 

<400> 27 



Met Ser Pro Pro 


Pro 


Leu 


Leu 


Gin Pro 


Leu 


Leu 


Leu Leu 


Leu 


Pro 


1 


5 








10 








15 


Leu Leu Asn Val 


Glu 


Pro 


Ser 


Gly Ala 


Thr 


Leu 


He Arg 


He 


Pro 




20 








25 








30 


Leu His Arg Val 


Gin 


Pro 


Gly 


Arg Arg 


He 


Leu 


Asn Leu 


Leu 


Arg 


35 








40 








45 


Gly Trp Arg Glu 


Pro 


Ala 


Glu 


Leu Pro 


Lys 


Leu Gly Ala 


Pro 


Ser 


50 








55 








60 


Pro Gly Asp Lys 


Pro 


He 


Phe 


Val Pro 


Leu 


Ser 


Asn Tyr 


Arg 


Asp 


65 








70 








75 


Gly Tyr Thr Thr 


Asp 


Leu 


He 


Pro Lys 


Pro 


Leu 


Ala Pro 


Ser 


Arg 


80 








85 








90 


Pro Met Gly Pro 


Ser 


Leu 


Pro 


Phe Asn 


Met 


Glu 


Leu Gly Gly 




95 








100 











<210> 28 
<211> 444 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7509113CD1 

<400> 28 



Met 


Gly 


Arg 


Pro 


Leu 


His 


Leu 


Val 


Leu 


Leu 


Ser 


Ala 


Ser 


Leu 


Ala 


1 




5 










10 










15 


Gly 


Leu 


Leu 


Leu 


Leu 


Gly 


Glu 


Ser 


Leu 


Phe 


He 


Arg Arg 


Glu 


Gin 








20 








25 










30 


Ala 


Asn 


Asn 


He 


Leu 

35 


Ala 


Arg 


Val 


Thr 


Arg 
40 


Ala 


Asn 


Ser 


Phe 


Leu 
45 


Glu 


Glu 


Met 


Lys 


Lys 


Gly 


His 


Leu 


Glu 


Arg 


Glu 


Cys 


Met 


Glu 


Glu 








50 










55 










60 


Thr 


Cys 


Ser 


Tyr 


Glu 


Glu 


Ala 


Arg 


Glu 


Val 


Phe 


Glu 


Asp 


Ser 


Asp 






65 










70 










75 


Lys 


Thr 


Asn 


Glu 


Phe 


Trp 


Asn 


Lys 


Tyr 


Lys 


Asp 


Gly 


Asp 


Gin 


Cys 








80 










85 










90 


Glu 


Thr 


Ser 


Pro 


Cys 
95 


Gin 


Asn 


Gin 


Gly 


Lys 
100 


Cys 


Lys 


Asp 


Gly 


Leu 
105 


Gly 


Glu 


Tyr 


Thr 


Cys 


Thr 


Cys 


Leu 


Glu 


Gly 


Phe 


Glu 


Gly 


Lys 


Asn 








110 










115 










120 


Cys 


Glu 


Leu 


Trp 


Pro 


Tyr 


Pro 


Cys 


Gly 


Lys 


Gin 


Thr 


Leu 


Glu 


Arg 








125 










130 










135 


Arg 


Lys 


Arg 


Ser 


Val 


Ala 


Gin 


Ala 


Thr 


Ser 


Ser 


Ser 


Gly Glu 


Ala 






140 










145 










150 


Pro 


Asp 


Ser 


He 


Thr 


Trp 


Lys 


Pro 


Tyr 


Asp 


Ala 


Ala 


Asp 


Leu 


Asp 








155 










160 










165 


Pro 


Thr 


Glu 


Asn 


Pro 


Phe 


Asp 


Leu 


Leu 


Asp 


Phe 


Asn 


Gin 


Thr 


Gin 










170 








175 










180 


Pro 


Glu 


Arg 


Gly 


Asp 


Asn 


Asn 


Leu 


Thr 


Arg 


He 


Val 


Gly Gly Gin 










185 










190 










195 


Glu 


Cys 


Lys 


Asp 


Gly 


Glu 


Cys 


Pro 


Trp 


Gin 


Ala 


Leu 


Leu 


He 


Asn 








200 










205 










210 


Glu 


Glu 


Asn 


Glu 


Gly 


Phe 


Cys 


Gly Gly Thr 


He 


Leu 


Ser 


Glu 


Phe 










215 










220 










225 


Tyr 


He 


Leu 


Thr 


Ala 


Ala 


His 


Cys 


Leu 


Tyr 


Gin 


Ala 


Lys 


Arg 


Phe 








230 








235 










240 


Lys 


Val 


Arg 


Val 


Gly Asp 


Arg 


Asn 


Thr 


Glu 


Gin 


Glu 


Glu 


Gly 


Gly 
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245 










250 








255 


Glu 


Ala 


Val 


His 


Glu 


Val 


Glu Val 


Val 


He 


Lys 


His 


Asn Arg 


Phe 








260 










265 








270 


Thr 


Lys 


Glu 


Thr 


Tyr 


Asp 


Phe 


Asp 


He 


Ala 


Val 


Leu 


Arg Leu 


Lys 








275 










280 








285 


Thr 


Pro 


He 


Thr 


Phe 


Arg 


Met 


Asn 


Val 


Ala 


Pro 


Ala 


Cys Leu 


Pro 








290 










295 








300 


Glu 


Arg 


Asp 


Trp 


Ala 


Glu 


Ser 


Thr 


Leu 


Met 


Thr 


Gin 


Lys Thr 


Gly 




305 










310 








315 


lie 


Val 


Ser Gly Phe 


Gly 


Arg 


Thr 


His 


Glu 


Lys 


Gly Arg Gin 


Ser 










320 










325 








330 


Thr 


Arg 


Leu 


Lys 


Met 


Leu 


Glu 


Val 


Pro 


Tyr 


Val 


Asp 


Arg Asn 


Ser 






335 










340 








345 


Cys 


Lys 


Leu 


Ser 


Ser 


Ser 


Phe 


He 


He 


Thr 


Gin 


Asn 


Met Phe 


Cys 






350 










355 








360 


Ala 


Gly 


Tyr 


Asp 


Thr 


Lys 


Gin 


Glu 


Asp 


Ala 


Cys 


Gin 


Gly Asp 


Ser 




365 










370 








375 


Gly 


Gly 


Pro 


His 


Val 


Thr 


Arg 


Phe 


Lys 


Asp 


Thr 


Tyr 


Phe Val 


Thr 






380 










385 








390 


Gly 


lie 


Val 


Ser 


Trp 


Gly 


Glu 


Gly Cys 


Ala 


Arg 


Lys 


Gly Lys 


Tyr 








395 










400 








405 


Gly 


He 


Tyr 


Thr 


Lys 


Val 


Thr 


Ala 


Phe 


Leu 


Lys 


Trp 


He Asp 


Arg 






410 










415 








420 


Ser 


Met 


Lys 


Thr 


Arg 


Gly 


Leu 


Pro 


Lys 


Ala 


Lys 


Ser 


His Ala 


Pro 








425 










430 








435 


Glu 


Val 


He 


Thr 


Ser 
440 


Ser 


Pro 


Leu 


Lys 













<210> 29 

<211> 377 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> inisc_f eature 

<223> Incyte ID No: 7509140CD1 



<400> 29 






















Ala 


Met 


Gly 


Arg 


Pro 


Leu 


His 


Leu 


Val Leu 


Leu 


Ser 


Ala 


Ser 


Leu 


1 




5 








10 










15 


Gly 


Leu 


Leu 


Leu 


Leu 


Gly 


Glu 


Ser Leu 


Phe 


He 


Arg 


Arg 


Glu 


Gin 








20 








25 










30 


Ala 


Asn 


Asn 


He 


Leu 


Ala 


Arg 


Val Thr 


Arg 


Ala 


Asn 


Ser 


Phe 


Leu 






35 






40 










45 


Glu 


Glu 


Met 


Lys 


Lys 


Gly 


His 


Leu Glu 


Arg 


Glu 


Cys 


Met 


Glu 


Glu 








50 








55 










60 


Thr 


Cys 


Ser 


Tyr 


Glu 


Glu 


Ala 


Arg Glu 


Val 


Phe 


Glu 


Asp 


Ser 


Asp 






65 








70 










75 


Lys 


Thr 


Asn 


Glu 


Phe 


Trp 


Asn 


Lys Tyr 


Lys 


Asp 


Gly Asp 


Gin 


Cys 








80 








85 








Gly 


90 


Glu 


Thr 


Ser 


Pro 


Cys 
95 


Gin 


Asn 


Gin Gly 


Lys 
100 


Cys 


Lys 


Asp 


Leu 
105 


Gly 


Glu 


Tyr 


Thr 


Cys 


Thr 


Cys 


Leu Glu 


Gly Phe 


Glu 


Gly Lys 


Asn 






110 








115 










120 


Cys 


Glu 


Leu 


Phe 


Thr 


Arg 


Lys 


Leu Cys 


Ser 


Leu 


Asp 


Asn Gly Asp 








125 








130 










135 


Cys 


Asp 


Gin 


Phe 


Cys 


His 


Glu 


Glu Gin 


Asn 


Ser 


Val 


Val 


Cys 


Ser 






140 








145 










150 


Cys 


Ala 


Arg 


Gly Tyr 


Thr 


Leu 


Ala Asp Asn Gly Lys 


Ala 


Cys 


He 






155 








160 










165 


Pro 


Thr 


Gly 


Pro 


Tyr 


Pro 


Cys 


Gly Lys 


Gin 


Thr 


Leu 


Glu 


Arg 


Arg 








170 








175 








Ala 


180 


Lys 


Arg 


Ser 


Val 


Ala 


Gin 


Ala 


Thr Ser 


Ser 


Ser Gly Glu 


Pro 
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185 






Asp 


Ser 


IXe Tnr 


Trp 
200 


Lys 


Pro 


ThlT 


Glu 


Asn Pro 


Pne 
215 


Asp 


Leu 


Glu 


Arg 


Gly Asp 


Asn 
230 


Asn 


Leu 


Cys 


Lys 


Asp Gly 


Glu 
245 


Cys 


Pro 


Glu 


Asn 


Glu Lys 


Gly 
260 


Arg 


Gin 


Val 


Pro 


Tyr Val 


Asp 
275 


Arg 


Asn 


lie 


lie 


Thr Gin 


Asn 
290 


Met 


Phe 


Glu 


Asp 


Ala Cys 


Gin 
305 


Gly 


Asp 


Phe 


Lys 


Asp Thr 


Tyr 

o o r\ 
O Z U 


Phe 


Val 


Gly 


Cys 


Ala Arg 


Lys 
335 


Gly 


Lys 


Ala 


Phe 


Leu Lys 


Trp 
350 


lie 


Asp 


Pro 


Lys 


Ala Lys 


Ser 
365 


His 


Ala 


Leu 


Lys 















xy u 










1 Q R 


Tyr 


Asp 




AX a 


Asp 


Leu 


Asp 


Pro 
















Z X u 


Leu 


Asp 


irne 


Asn 


vjxn 


Thr 


Gin 


Pro 






o o n 










z z □ 


Tnr 


Arg 


X J.e 


vax 


CrrXy 


Gly Gin 


Vj-LU 






Z J D 










9 AH 


Trp 


Gin 


-TV 1 _ 

Ala 


Leu 


Leu 


He 


Asn 


CjXU 




250 










9 Rt^ 


Ser 


Thr 


Arg 


Leu 


Lys 


Met 


Leu 


i^iXU 






265 










OTA 

z /U 


Ser 


Cys 


Lys 


Leu 


Ser 


Ser 


Ser 


Pne 






280 










9 Q R 
ZOO 


Cys 


Ala 


Gly 


Tyr 


Asp 


Thr 


Lys- 


GXn 






295 










•5 A A 


Ser 


Gly Gly 


Pro 


His 


Val 


Thr 


Arg 






310 










315 


Thr 


Gly 


He 


Val 


Ser 


Trp Gly 


Glu 






325 










330 


Tyr 


Gly 


He 


Tyr 


Thr 


Lys 


Val 


Thr 






340 










345 


Arg 


Ser 


Met 


Lys 


Thr 


Arg Gly 


Leu 






355 










360 


Pro 


Glu 


Val 


He 


Thr 


Ser 


Ser 


Pro 






370 










375 



<210> 30 
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<212> PRT 
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<220> 
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<223> Incyte ID No: 7509223CD1 

<400> 30 



Met 


Gly 


Arg 


Pro 


Leu 


His 


Leu 


Val 


Leu 


Leu 


Ser 


Ala 


Ser 


Leu 


Ala 


1 








5 










10 










15 


Gly 


Leu 


Leu 


Leu 


Leu 


Gly 


Glu 


Ser 


Leu 


Phe 


He 


Arg 


Arg 


Glu 


Gin 










20 










25 










30 


Ala 


Asn 


Asn 


He 


Leu 


Ala 


Arg 


Val 


Thr 


Arg 


Ala 


Asn 


Ser 


Phe 


Leu 










35 










40 










45 


Glu 


Glu 


Met 


Lys 


Lys 


Gly 


His 


Leu 


Glu 


Arg 


Glu 


Cys 


Met 


Glu 


Glu 










50 










55 










60 


Thr 


Cys 


Ser 


Tyr 


Glu 


Glu 


Ala 


Arg 


Glu 


Val 


Phe 


Glu 


Asp 


Ser 


Asp 










65 










70 










75 


Lys 


Thr 


Asn 


Glu 


Phe 


Trp 


Asn 


Lys 


Tyr 


Lys 


Asp 


Gly 


Asp 


Gin 


Cys 










80 










85 










90 


Glu 


Thr 


Ser 


Pro 


Cys 


Gin 


Asn 


Gin 


Gly 


Lys 


Cys 


Lys 


Asp 


Gly 


Leu 










95 










100 










105 


Gly 


Glu 


Tyr 


Thr 


Cys 


Thr 


Cys 


Leu 


Glu 


Gly 


Phe 


Glu 


Gly Lys 


Asn 










110 










115 










120 


Cys 


Glu 


Leu 


Phe 


Thr 


Arg 


Lys 


Leu 


Cys 


Ser 


Leu 


Asp 


Asn Gly 


Asp 










125 










130 










135 


Cys 


Asp 


Gin 


Phe 


Cys 


His 


Glu 


Glu 


Gin 


Asn 


Ser 


Val 


Val 


Cys 


Ser 










140 










145 










150 


Cys 


Ala 


Arg 


Gly 


Tyr 


Thr 


Leu 


Ala 


Asp 


Asn 


Gly Lys 


Ala 


Cys 


He 










155 










160 










165 


Pro 


Thr 


Gly 


Pro 


Tyr 


Pro 


Cys 


Gly 


Lys 


Gin 


Thr 


Leu 


Glu 


Arg 


Arg 










170 










175 










180 


Lys 


Arg 


Ser 


Val 


Ala 


Gin 


Ala 


Thr 


Ser 


Ser 


Ser 


Gly Glu Ala 


Pro 
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185 










190 










195 


Asp Ser 


lie 


Thr Trp 


Lys 


Pro 


Tyr 


Asp 


Ala 


Ala 


Asp 


Leu 


Asp 


Pro 




200 










205 










210 


Thr Glu 


Asn 


Pro Phe 
215 


Asp 


Leu 


Leu 


Asp 


Phe 
220 


Asn 


Gin 


Thr 


Gin 


Pro 
225 


Glu Arg Gly Asp Asn 


Asn 


Leu 


Thr 


Arg 


He Val Gly Gly Gin Glu 






230 










235 










240 


Cys Lys Asp Gly Glu Cys 


Pro 


Trp 


Gin 


Ala 


Leu 


Leu 


He 


Asn 


Glu 






245 










250 










255 


Glu Asn 


Glu 


Val Glu 
260 


Val 


Val 


He 


Lys 


His 
265 


Asn 


Arg 


Phe 


Thr 


Lys 
270 




Tyr 




Asp 


He 


Ala 


Val 


Leu 


Arg 


Leu 


Lys 


Thr 


Pro 




275 










280 










285 


lie Thr 


Phe 




Asn 


Val 


Ala 


Pro 


Ala 
295 


Cys 


Leu 


Pro 


Glu 


Arg 
300 


Asp Trp 


Ala 


Glu Ser 


Thr 


Leu 


Met 


Thr 


Gin 


Lys 


Thr 


Gly 


He 


Val 




305 










310 










315 


Ser Gly 


Phe 


Gly Arg 


Thr 


His 


Glu 


Lys 


Gly Arg Gin Ser 


Thr 


Arg 




320 










325 










330 


Leu Lys 


Met 


Leu Glu 


Val 


Pro 


Tyr 


Val 


Asp 


Arg 


Asn 


Ser 


Cys 


Lys 




335 










340 










345 


Leu Ser 


Ser 


Ser Phe 


He 


He 


Thr 


Gin 


Asn 


Met 


Phe 


Cys 


Ala 


Gly 




350 










355 










360 


Tyr Asp 


Thr 


Lys Gin 


Glu 


Asp 


Ala 


Cys 


Gin 


Gly Asp 


Ser 


Gly Gly 




365 










370 










375 


Pro His 


Val 


Thr Arg 
380 


Phe 


Lys 


Asp 


Thr 


Tyr 

385 


Phe 


Val 


Thr 


Gly 


He 
390 


Val Ser 


Trp 


Gly Glu 


Gly 


Cys 


Ala 


Arg 


Lys 


Gly 


Lys 


Tyr 


Gly 


He 




395 










400 










405 


Tyr Thr 


Lys 


Val Thr 


Ala 


Phe 


Leu 


Lys 


Trp 


He 


Asp 


Arg 


Ser 


Met 


410 










415 










420 


Lys Thr 


Arg Gly Leu 


Pro 


Lys 


Ala 


Lys 


Ser 


His 


Ala 


Pro 


Glu 


Val 




425 










430 










435 


lie Thr 


Ser 


Ser Pro 
440 


Leu 


Lys 



















<210> 31 
<211> 375 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7509272CD1 

<400> 31 



Met 


Glu 


Arg 


Asp 


Ser 


His 


Gly 


Asn 


Ala 


Ser 


Pro 


Ala 


Arg 


Thr 


Pro 


1 




5 










10 










15 


Ser 


Ala 


Gly 


Ala 


Ser 


Pro 


Ala 


Gin 


Ala 


Ser 


Pro 


Ala 


Gly 


Thr 


Pro 








20 










25 










30 


Pro 


Gly 


Arg 


Ala 


Ser 


Pro 


Ala 


Gin 


Ala 


Ser 


Pro 


Ala 


Gin 


Ala 


Ser 






35 










40 










45 


Pro 


Ala 


Gly 


Thr 


Pro 


Pro 


Gly 


Arg 


Ala 


Ser 


Pro 


Ala 


Gin 


Ala 


Ser 








50 










55 










60 


Pro 


Ala 


Gly 


Thr 


Pro 


Pro 


Gly 


Arg 


Ala 


Ser 


Pro 


Gly Arg Ala 


Ser 








65 










70 










75 


Pro 


Ala 


Gin 


Ala 


Ser 
80 


Pro 


Ala 


Gin 


Ala 


Ser 
85 


Pro 


Ala 


Arg 


Ala 


Ser 
90 


Pro 


Ala 


Leu 


Ala 


Ser 
95 


Leu 


Ser 


Arg 


Ser 


Ala 
100 


Pro 


Ala 


Thr 


Arg 


Ala 
105 


Thr 


Arg 


Glu 


Ser 


Pro 


Gly 


Thr 


Ser 


Leu 


Pro 


Lys 


Phe 


Thr 


Trp 


Arg 








110 










115 










120 


Glu 


Gly 


Gin 


Lys 


Gin 


Leu 


Pro 


Leu 


He 


Gly cys 


Val 


Leu 


Leu 


Leu 
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125 








X J u 










135 


He 


Ala 


Leu Val Val 


Ser Leu 


He 


He 


Leu 


Phe 


ljrJ.n 




irp 


m n 

J.XJI 




140 


















150 


Glv His 


Thr Gly He 


Arg Tyr 


Lys 


Glu 


Gin 


Arg 




Ser 


Cys 


XT J. \J 






155 


















165 


Lys 


His Ala Val Arg Cys Asp Gly Val 


Vai 


ASp 


Cys 


Lys 


Leu 






170 








1 /D 










180 


Ser 


Asp 


Glu Leu Gly Cys Val 


Arg 


Phe 


Asp 


Trp 


Asp 


Lys 


Ser 


Leu 




185 








too 










X -? 


Leu 


Lys 


He Tyr Ser Gly Ser 


Ser 


His 


Gin 


Trp 


Leu 


Pro 


X j.e 


Cys 




200 








O A c 

205 










91 n 


Ser 


Ser Asn Trp Asn Asp Ser 


Tyr 


Ser 


Glu 


Lys 


Thr 


Cys 


Qsxn 


VaXn 






215 








220 










225 


Leu 


Gly 


Phe Glu Ser 


Ala His 


Arg 


Thr 


Thr 


Glu 


Val 


Ala 


His 


Arg 




230 








235 










240 


Asp 


Phe 


Ala Asn Ser 


Phe Ser 


He 


Leu 


Arg 


Tyr 


Asn 


Ser 


Thr 


He 




245 








250 










255 


Gin 


Glu 


Ser Leu His 


Arg Ser 


Glu 


Cys 


Pro 


Ser 


Gin Arg 


Tyr 


He 






260 








265 










270 


Ser 


Leu 


Gin Cys Ser 


His Cys 


Gly Leu 


Arg 


Ala 


Met 


Thr 


Gly Arg 






275 








280 










285 


He 


Val 


Gly Gly Ala 
290 


Leu Ala 


Ser 


Asp 


Ser 
295 


Lys 


Trp 


Pro 


Trp 


Gin 
300 


Val 


Ser 


Leu His Phe 


Gly Thr 


Thr 


His 


He Cys Gly Gly Thr 


Leu 






305 








310 










315 


He 


Asp 


Ala Gin Trp 


Val Leu 


Thr 


Ala 


Ala 


His 


Cys 


Phe 


Phe 


Val 




320 








325 










330 


Thr 


Arg 


Glu Lys Val 


Leu Glu 


Gly 


Trp 


Lys 


Val 


Tyr 


Ala 


Gly Thr 




335 








340 










345 


Ser 


Asn 


Leu His Gin 
350 


Leu Pro 


Glu 


Ala 


Ala 
355 


Ser 


He 


Ala 


Glu 


He 

360 


He 


He 


Asn Ser Asn 


Tyr Thr 


Asp 


Glu 


Glu 


Glu 


Val 


Lys 


Asp 


Ser 






365 






370 










375 



<210> 32 

<211> 204 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7509327CD1 

<400> 32 



Met 


Glu 


Arg 


Asp 


Ser 


His 


Gly 


Asn 


1 








5 








Ser 


Ala 


Gly 


Ala 


Ser 
20 


Pro 


Ala 


Gin 


Pro 


Gly 


Arg 


Ala 


Ser 

35 


Pro 


Ala 


Gin 


Pro 


Ala 


Gly 


Thr 


Pro 
50 


Pro 


Gly 


Arg 


Pro 


Ala 


Gly 


Thr 


Pro 
65 


Pro 


Gly 


Arg 


Pro 


Ala 


Gin 


Ala 


Ser 
80 


Pro 


Ala 


Gin 


Pro 


Ala 


Leu 


Ala 


Ser 
95 


Leu 


Ser 


Arg 


Ser 


Ala 


Arg 


Ser 


Ala 
110 


Ser 


Val 


Thr 


Leu 


Val 


Arg 


Ala 


Thr 
125 


Pro 


Val 


Gly 



Ala 


Ser 


Pro 


Ala Arg 


Thr 


Pro 




10 








15 


Ala 


Ser Pro Ala Gly Thr 


Pro 




25 








30 


Ala 


Ser 


Pro 


Ala Gin 


Ala 


Ser 




40 








45 


Ala 


Ser 


Pro 


Ala Gin 


Ala 


Ser 




55 








60 


Ala 


Ser 


Pro 


Gly Arg 


Ala 


Ser 




70 








75 


Ala 


Ser 


Pro 


Ala Arg 


Ala 


Ser 




85 








90 


Ser 


Ser 


Ser 


Gly Arg 


Ser 


Ser 




100 








105 


Thr 


Ser 


Pro 


Thr Arg 


Val 


Tyr 




115 








120 


Ala 


Val 


Pro 


He Arg 


Ser 


Ser 




130 








135 



31/112 



wo 03/063688 



PCT/US03/02500 



Pro 


Ala 


Arg 


Ser 


Ala 


Pro 


Ala 


Thr 


Arg 


Ala 


Thr Arg 


Glu Ser 


Pro 








140 










145 






150 


Gly Thr 


Ser 


Leu 


Pro 


Lys 


Phe Thr Trp Arg Glu Gly Gin Lys 


Gin 










155 










160 






165 


Leu 


Pro 


Leu 


He 


Gly 


Cys 


Val 


Leu 


Leu 


Leu 


He Ala 


Leu Val 


Val 










170 










175 






180 


Ser 


Leu 


He 


He 


Leu 


Leu 


Leu 


Thr 


Gly 


Gin 


Pro Arg 


Leu Pro 


Thr 










185 










190 






195 


Gly 


He 


Leu 


Pro 


Thr 


Ala 


Ser 


Gin 


Ser 
















200 



















<210> 33 
<211> 86 
<212> PRT 
<213> Homo 



sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7504677CD1 



<400> 33 










Met Pro Lys 


Thr 


Met 


His 


Phe 


1 




5 






Tyr Leu Trp 


Gly 


Leu 


Phe 


Thr 






20 






Ser Thr Glu 


Glu 


Val 


Lys 


He 






35 






Cys Ser Lys 


Thr 


Ser 


Lys 


Lys 






50 






Asp Gly Tyr 


Leu 


Ala 


Lys 


Asp 






65 






Arg Gin Asp 


Ser 


Thr 


Gly Cys 






80 







Leu 


Phe 


Arg 
10 


Phe 


He 


Val 


Phe 


Phe 
15 


Ala 


Gin 


Arg 
25 


Gin 


Lys 


Lys 


Glu 


Glu 
30 


Glu 


Val 


Leu 
40 


His 


Arg 


Pro 


Glu 


Asn 
45 


Gly Asp 


Leu 


Leu 


Asn 


Ala 


His 


Tyr 






55 










60 


Gly 


Ser 


Lys 
70 


Phe 


Tyr 


Cys 


Ser 


Arg 
75 


Tyr 


He 


Asp 
85 


Phe 











<210> 34 

<211> 882 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eatiire 

<223> Incyte ID No: 7504534CD1 



<400> 34 










Met 


Ala 


Glu 


Gly 


Gly 


Gly 


Cys 


1 








5 






Gin 


Lys 


Ser 


Glu 


Leu 
20 


Gly 


Pro 


Gly 


Ala 


Gin 


Trp 


Tyr 
35 


Leu 


He 


Lys 


Lys 


Tyr 


Val 


Gly 
50 


Phe 


Asp 


Glu 


His 


Asn 


Leu 


Phe 
65 


Pro 


Gly 


Ser 


Asp 


Pro 


Glu 


Ser 
80 


Gin 


Thr 


Leu 


Asp 


Tyr 


Val 


Leu 
95 


Val 


Pro 


Asn 


Trp 


Tyr 


Gly 


Cys 
110 


Val 


Glu 


Val 


Val 


Glu 


His 


Gly 
125 


Leu 


Phe 


Tyr 


Leu 


Leu 


Glu 


Leu 


Lys 


Leu 



Arg 


Glu 


Arg 


Pro 


Asp 


Ala 


Glu 


Thr 






10 










15 


Leu 


Met 


Arg 


Thr 


Thr 


Leu 


Gin 


Arg 






25 










30 


Asp 


Ser 


Arg 


Trp 


Phe 


Lys 


Gin 


Trp 




40 










45 


Ser 


Trp 


Gly Met 


Tyr 


Asn 


Val 


Gly 




55 










60 


Pro 


He 


Asp 


Asn 


Ser 


Gly Leu 


Phe 






70 










75 


Leu 


Lys 


Glu 


His 


Leu 


He 


Asp 


Glu 




85 










90 


Thr 


Glu 


Ala 


Trp 


Asn 


Lys 


Leu 


Leu 






100 










105 


Gly Gin 


Gin 


Pro 


He 


Val 


Arg 


Lys 






115 










120 


Val 


Lys 


His 


Cys 


Lys 


Val 


Glu Val 




130 










135 


Cys 


Glu 


Asn 


Ser 


Asp 


Pro 


Thr 


Asn 
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140 145 150 

Val Leu Ser Cys His Phe Ser Lys Ala Asp Thr He Ala Thr He 
155 160 165 

Glu Lys Glu Met Arg Lys Leu Phe Asn He Pro Ala Glu Arg Glu 
170 175 180 

Thr Arg Leu Trp Asn Lys Tyr Met Ser Asn Thr Tyr Glu Gin Leu 
185 190 195 

Ser Lys Leu Asp Asn Thr Val Gin Asp Ala Gly Leu Tyr Gin Gly 
200 205 210 

Gin Val Leu Val He Glu Pro Gin Asn Glu Asp Gly Thr Trp Pro 
215 220 225 

Arg Gin Thr Leu Gin Ser Lys Ser Ser Thr Ala Pro Ser Arg Asn 
230 235 240 

Phe Thr Thr Ser Pro Lys Ser Ser Ala Ser Pro Tyr Ser Ser Val 
245 250 255 

Ser Ala Ser Leu He Ala Asn Gly Asp Ser Thr Ser Thr Cys Gly 
260 265 270 

Met His Ser Ser Gly Val Ser Arg Gly Gly Ser Gly Phe Ser Ala 
275 280 285 

Ser Tyr Asn Cys Gin Glu Pro Pro Ser Ser His He Gin Pro Gly 
290 295 300 

Leu Cys Gly Leu Gly Asn Leu Gly Asn Thr Cys Phe Met Asn Ser 
305 310 315 

Ala Leu Gin Cys Leu Ser Asn Thr Ala Pro Leu Thr Asp Tyr Phe 
320 325 330 

Leu Lys Asp Glu Tyr Glu Ala Glu He Asn Arg Asp Asn Pro Leu 
335 340 345 

Gly Met Lys Gly Glu He Ala Glu Ala Tyr Ala Glu Leu He Lys 
350 355 360 

Gin Met Trp Ser Gly Arg Asp Ala His Val Ala Pro Arg Met Phe 
365 370 375 

Lys Thr Gin Val Gly Arg Phe Ala Pro Gin Phe Ser Gly Tyr Gin 
380 385 390 

Gin Gin Asp Ser Gin Glu Leu Leu Ala Phe Leu Leu Asp Gly Leu 
395 400 405 

His Glu Asp Leu Asn Arg Val Lys Lys Lys Pro Tyr Leu Glu Leu 
410 415 420 

Lys Asp Ala Asn Gly Arg Pro Asp Ala Val Val Ala Lys Glu Ala 
425 430 435 

Trp Glu Asn His Arg Leu Arg Asn Asp Ser Val He Val Asp Thr 
440 445 450 

Phe His Gly Leu Phe Lys Ser Thr Leu Val Cys Pro Glu Cys Ala 
455 460 465 

Lys Val Ser Val Thr Phe Asp Pro Phe Cys Tyr Leu Thr Leu Pro 
470 475 480 

Leu Pro Leu Lys Lys Asp Arg Val Met Glu Val Phe Leu Val Pro 
485 490 495 

Ala Asp Pro His Cys Arg Pro Thr Gin Tyr Arg Val Thr Val Pro 
500 505 510 

Leu Met Gly Ala Val Ser Asp Leu Cys Glu Ala Leu Ser Arg Leu 
515 520 525 

Ser Gly He Ala Ala Glu Asn Met Val Val Ala Asp Val Tyr Asn 
530 535 540 

His Arg Phe His Lys He Phe Gin Met Asp Glu Gly Leu Asn His 
545 550 555 

He Met Pro Arg Asp Asp He Phe Val Tyr Glu Val Cys Ser Thr 
560 565 570 

Ser Val Asp Gly Ser Glu Cys Val Thr Leu Pro Val Tyr Phe Arg 
575 580 585 

Glu Arg Lys Ser Arg Pro Ser Ser Thr Ser Ser Ala Ser Ala Leu 
590 595 600 

Tyr Gly Gin Pro Leu Leu Leu Ser Val Pro Lys His Lys Leu Thr 
605 610 615 
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Leu 


Glu 


Ser 


Leu 


Tyr 
620 




Lys 


Gin 


Pro 


Leu 
635 


Pro 


Gly 


Ala 


Cys 


Asn 
650 




Glu 


Glu 


Met 


Glu 
665 


Glu 


Thr 


Glu 


Gly 


Ser 
680 


Glu 


Thr 


Thr 


Gin 


Lys 
695 


Leu 


Phe 


Thr 


Phe 


Ser 

710 


Asn 


Ser 


Leu 


Ala 


Ala 
725 


Ser 


Thr 


Leu 


Ala 


Met 
740 


Tyr 


Asp 


Glu 


Gin 


Glu 
755 


Leu 


Gin 


Pro 


Gin 


Lys 
770 


Cys 


He 


Glu 


Leu 


Phe 
785 


Pro 


Trp 


Tyr 


Cys 


Pro 
800 


Lys 


Phe 


Asp 


Leu 


Trp 
815 


Lys 


Arg 


Phe 


Ser 


Tyr 
830 


Val 


Val 


Glu 


Phe 


Pro 
845 


Cys 


Asn 


Leu 


Ser 


Ala 
860 


Ser 


Asn 


His 


Tyr 


Gly 
875 



<210> 35 
<211> 4374 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> inisc_feature 
<223> Incyte ID No: 

<400> 35 



Met 


Lys 


Val 


Asp 


Arg 


1 








5 


Pro 


Ala 


Asp 


Cys 


Arg 
20 


Asp 


Glu 


Gin 


Leu 


Leu 
35 


He 


Gly 


Lys 


Cys 


Glu 
50 


Phe 


Asp 


Gly 


He 


Leu 
65 


Ser 


Trp 


Met 


Leu 


Val 
80 


Met 


Leu 


Leu 


Leu 


Ala 
95 


Tyr 


Ser 


Phe 


Ser 


Arg 
110 



Gin 


TV T _ 


Val 


Cys 


Asp 
625 


Pro 


Asp 


Glu 


Pne 


Gly 
640 


Gly 


Ser 


Arg 


Asn 


Ser, 
655 


His 


Gin 


Glu 


Glu 


Gly 
670 


Gly 


Glu 


Asp 


Glu 


Pro 
685 


Lys 


He 


Lys 


Gly Gin 










700 


Leu 


Val 


Asn 


Ser 


Tyr 
715 


Asp 


Gly 


Lys 


Leu 


Leu 
730 


Asp 


Trp 


Asp 


Ser 


Glu 
745 


Ser 


Glu 


Ala 


Tyr 


Glu 
760 


Lys 


Lys 


Lys 


Thr 


Thr 
775 


Thr 


Thr 


Met 


Glu 


Thr 
790 


Asn 


Cys 


Lys 


Lys 


His 
805 


Ser 


Leu 


Pro 


Lys 


He 
820 


Asn 


Arg 


Tyr 


Trp 


Arg 
835 


He 


Arg 


Gly Leu 


Asn 










850 


Arg 


Pro 


Tyr 


Val 


Tyr 
865 


Ala 


Met 


Gly Val 


Gly 



880 



7507771CD1 



Thr 


Lys 


Leu 


Lys 


Lys 
10 


Ala 


Leu 


He 


Asp 


Lys 
25 


Leu 


Glu 


Leu 


Gin 


Gin 
40 


Leu 


Tyr 


His 


Trp 


Val 
55 


Ala 


Asp 


Ala 


Gly Gin 










70 


Cys 


Asp 


Arg 


Pro 


Glu 
85 


Val 


Leu 


Asn 


Phe 


Thr 

100 


His 


Leu 


Tyr 


Ser 


Ser 
115 



Arg 


lie 


Ser Arg 


Tyr 












Ser 


Ser 


Pro 


Leu 


Glu 












Cys 


Glu 


Gly Glu 


Asp 










660 


Lys 


Glu 


Gin 


Leu 


Ser 










675 


Gly Asn Asp 


Pro 


Ser 










690 


Pro 


Cys 


Pro 


Lys 


Arg 










705 


Gly Thr Ala 


Asp 


He 










720 


Lys 


Leu 


Asn 


Ser 


Arg 










735 


Thr 


Arg 


Arg 


Leu 


Tyr 










750 


Lys 


His 


Val 


Ser 


Met 










765 


Val 


Ala 


Leu 


Arg 


Asp 










780 


Leu 


Gly Glu 


His 


Asp 










795 


Gin 


Gin 


Ala 


Thr 


Lys 










810 


Leu 


Val 


Val 


His 


Leu 










825 


Asp 


Lys 


Leu 


Asp 


Thr 










840 


Met 


Ser 


Glu 


Phe 


Val 










855 


Asp 


Leu 


He 


Ala 


Val 










870 


His 


Tyr 









Thr 


Pro 


Thr 


Glu 


Ala 
15 


Leu 


Lys 


Val 


Cys 


Asn 
30 


He 


Lys 


Thr 


Trp 


Asn 
45 


Asp 


Leu 


Leu 


Asp 


Arg 
60 


Thr 


Val 


Glu 


Asn 


Met 
75 


Arg 


Glu 


Gin 


Leu 


Lys 
90 


Ala 


Leu 


Leu 


He 


Glu 
105 


He 


Glu 


His 


Leu 


Thr 
120 
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Thr 


Leu 


Leu 


Ala 


Ser 


Ser Asp Met 


Gin 


Val 


Val 


Leu 


Ala 


Val 


Leu 










125 










130 










135 


Asn Leu Leu Tyr Val 


Phe 


Ser 


Lys 


Arg 


Ser 


Asn 


Tyr 


He 


Thr 


Arg 










140 










145 










150 


Leu 


Gly 


Ser Asp Lys Arg 


Thr 


Pro 


Leu 


Leu 


Thr 


Arg 


Leu 


Gin 


His 








155 










160 










165 


Leu 


Ala 


Glu 


Ser Trp Gly 


Gly Lys 


Glu 


Asn 


Gly 


Phe Gly Leu Ala 










170 










175 










180 


Glu 


Cys 


Cys 


Arg 


Asp 


Leu 


His 


Met 


Met 


Lys 


Tyr 


Pro 


Pro 


Ser 


Ala 






185 










190 










195 


Thr 


Thr 


Leu 


His 


Phe 


Glu 


Phe 


Tyr 


Ala 


Asp 


Pro 


Gly Ala 


Glu 


Val 










200 










205 










210 


Lys 


He 


Glu 


Lys 


Arg 


Thr 


Thr 


Ser 


Asn 


Thr 


Leu 


His 


Tyr 


He 


His 






215 










220 










225 


lie 


Glu 


Gin 


Leu 


Asp 


Lys 


He 


Ser 


Glu 


Ser 


Pro 


Ser 


Glu 


He 


Met 










230 










235 










240 


Glu 


Ser 


Leu 


Thr 


Lys 


Met 


Tyr 


Ser 


Thr 


Pro 


Lys 


Asp 


Lys 


Gin 


Met 










245 










250 










255 


Leu 


Leu 


Phe 


Thr 


His 


He 


Arg 


Leu 


Ala 


His 


Gly 


Phe 


Ser 


Asn 


His 










260 










265 










270 


Arg 


Lys 


Arg 


Leu 


Gin 


Ala 


Val 


Gin 


Ala 


Arg 


Leu 


His 


Ala 


He 


Ser 




275 










280 










285 


He 


Leu 


Val 


Tyr 


Ser 


Asn 


Ala 


Leu 


Gin 


Glu 


Ser 


Ala 


Asn 


Ser 


He 








290 










295 










300 


Leu Tyr Asn Gly Leu 


He 


Glu 


Glu 


Leu 


Val 


Asp 


Val 


Leu 


Gin 


He 










305 










310 










315 


Thr 


Asp 


Lys 


Gin 


Leu 


Met 


Glu 


He 


Lys 


Ala 


Ala 


Ser 


Leu 


Arg 


Thr 






320 










325 










330 


Leu 


Thr 


Ser 


He 


Val 


His 


Leu 


Glu 


Arg 


Thr 


Pro 


Lys 


Leu 


Ser 


Ser 










335 










340 










345 


He 


He 


Asp 


Cys 


Thr 


Gly Thr Ala 


Ser 


Tyr 


His 


Gly 


Phe 


Leu 


Pro 






350 










355 










360 


Val 


Leu 


Val 


Arg 


Asn 


Cys 


He 


Gin 


Ala 


Met 


He 


Asp 


Pro 


Ser 


Met 










365 










370 










375 


Asp 


Pro 


Tyr 


Pro 


His 


Gin 


Phe 


Ala 


Thr 


Ala 


Leu 


Phe 


Ser 


Phe 


Leu 






380 










385 










390 


Tyr 


His 


Leu 


Ala 


Ser 


Tyr 


Asp 


Ala 


Gly Gly Glu Ala 


Leu 


Val 


Ser 








395 










400 










405 


Cys 


Gly Met 


Met 


Glu 


Ala 


Leu 


Leu 


Lys 


Val 


He 


Lys 


Phe 


Leu 


Gly 








410 










415 










420 


Asp 


Glu 


Gin 


Asp 


Gin 


He 


Thr 


Phe 


Val 


Thr 


Arg 


Ala 


Val 


Arg 


Val 






425 










430 










435 


Val 


Asp 


Leu 


He 


Thr 


Asn 


Leu 


Asp 


Met 


Ala 


Ala 


Phe 


Gin 


Ser 


His 








440 










445 










450 


Ser 


Gly 


Leu 


Ser 


He 


Phe 


He 


Tyr 


Arg 


Leu 


Glu 


His 


Glu 


Val 


Asp 








455 










460 










465 


Leu 


Cys 


Arg 


Lys 


Glu 


Cys 


Pro 


Phe 


Val 


He 


Lys 


Pro 


Lys 


He 


Gin 








470 










475 










480 


Arg 


Pro 


Asn 


Thr 


Thr 


Gin 


Glu 


Gly 


Glu 


Glu 


Met 


Glu 


Thr 


Asp 


Met 








485 










490 










495 


Asp 


Gly Val 


Gin 


Cys 


He 


Pro 


Gin 


Arg 


Ala 


Ala 


Leu 


Leu 


Lys 


Ser 








500 










505 










510 


Met 


Leu 


Asn 


Phe 


Leu 


Lys 


Lys 


Ala 


He 


Gin 


Asp 


Pro 


Ala 


Phe 


Ser 










515 










520 










525 


Asp Gly He Arg His Val Met Asp Gly Ser 


Leu 


Pro 


Thr 


Ser 


Leu 










530 










535 










540 


Lys 


His 


He 


He 


Ser 


Asn 


Ala 


Glu 


Tyr 


Tyr 


Gly 


Pro 


Ser 


Leu 


Phe 








545 










550 










555 


Leu 


Leu 


Ala 


Thr 


Glu 


Val 


Val 


Thr 


Val 


Phe 


Val 


Phe 


Gin 


Glu 


Pro 








560 










565 










570 


Ser 


Leu 


Leu 


Ser 


Ser 


Leu Gin Asp Asn Gly Leu 


Thr 


Asp 


Val 


Met 










575 










580 










585 


Leu 


His 


Ala 


Leu 


Leu 


He 


Lys 


Asp Val 


Pro 


Ala 


Thr 


Arg 


Glu 


Val 
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590 










595 










600 


Leu 


Gly 


Ser 


Leu 


Pro 


Asn 


Val 


Phe 


Ser 


Ala 


Leu 


Cys 


Leu 


Asn 


Ala 








605 










610 










615 


Arg Gly Leu 


Gin 


Ser 


Phe 


Val 


Gin 


Cys 


Gin 


Pro 


Phe 


Glu 


Arg 


Leu 










620 










625 










630 


Phe 


Lys 


Val 


Leu 


Leu 


Ser 


Pro 


Asp 


Tyr 


Leu 


Pro 


Ala 


Met 


Arg 


Arg 








635 










640 










645 


Arg 


Arg 


Ser 


Ser 


Asp 


Pro Leu Gly Asp Thr Ala 


Ser Asn Leu Gly 






650 










655 










660 


Ser 


Ala 


Val 


Asp 


Glu 


Leu 


Met 


Arg 


His 


Gin 


Pro 


Thr 


Leu Lys 


Thr 








665 










670 










675 


Asp 


Ala 


Thr 


Thr 


Ala 


He 


He 


Lys 


Leu 


Leu 


Glu 


Glu 


He 


Cys 


Asn 








680 










685 










690 


Leu Gly Arg Asp 


Pro 


Lys 


Tyr 


He 


Cys 


Gin 


Lys 


Pro 


Ser 


He 


Gin 










695 










700 










705 


Lys 


Ala 


Asp 


Gly Thr 


Ala 


Thr 


Ala 


Pro 


Pro 


Pro 


Arg 


Ser 


Asn 


His 






710 










715 










720 


Ala 


Ala 


Glu 


Glu 


Ala 
725 


Ser 


Ser 


Glu 


Asp 


Glu 
730 


Glu 


Glu 


Glu 


Glu 


Val 
735 


Gin 


Ala 


Met 


Gin 


Ser 
740 


Phe 


Asn 


Ser 


Thr 


Gin 
745 


Gin 


Asn 


Glu 


Thr 


Glu 
750 


Pro 


Asn 


Gin 


Gin 


Val 
755 


Val 


Gly 


Thr 


Glu 


Glu 
760 


Arg 


He 


Pro 


He 


Pro 
765 


Leu 


Met 


Asp 


Tyr 


He 


Leu 


Asn 


Val 


Met 


Lys 


Phe 


Val 


Glu 


Ser 


He 






770 










775 










780 


Leu 


Ser 


Asn 


Asn 


Thr 
785 


Thr 


Asp 


Asp 


His 


Cys 
790 


Gin 


Glu 


Phe 


Val 


Asn 
795 


Gin 


Lys 


Glv 


Leu 


Leu 


Pro 


Leu 


Val 


Thr 


He 


Leu 


Gly 


Leu 


Pro 


Asn 






800 










805 










810 


Leu 


Pro 


He 


Asp 


Phe 


Pro 


Thr 


Ser 


Ala 


Ala 


Cys 


Gin 


Ala 


Val 


Ala 








815 










820 










825 


Gly Val 


Cys 


Lys 


Ser 


He 


Leu 


Thr 


Leu 


Ser 


His 


Glu 


Pro 


Lys 


Val 






830 










835 










840 


Leu 


Gin 


Glu 


Gly Leu 


Leu 


Gin 


Leu 


Asp 


Ser 


He 


Leu 


Ser 


Ser 


Leu 










845 










850 










855 


Glu 


Pro 


Leu 


His 


Arg 
860 


Pro 


He 


Glu 


Ser 


Pro 
865 


Gly 


Gly 


Ser 


Val 


Leu 
870 


Leu 


Arg 


Glu 


Leu 


Ala 


Cys 


Ala 


Gly Asn Val 


Ala 


Asp 


Ala 


Thr 


Leu 








875 










880 










885 


Ser 


Ala 


Gin 


Ala 


Thr 
890 


Pro 


Leu 


Leu 


His 


Ala 
895 


Leu 


Thr 


Ala 


Ala 


His 
900 


Ala 


Tyr 


He 


Met 


Met 


Phe 


Val 


His 


Thr 


Cys 


Arg 


Val 


Gly Gin 


Ser 








905 










910 










915 


Glu 


He 


Arg 


Ser 


He 


Ser 


Val 


Asn 


Gin 


Trp 


Gly 


Ser 


Gin 


Leu 


Gly 








920 










925 










930 


Leu 


Ser 


Val 


Leu 


Ser 


Lys 


Leu 


Ser 


Gin 


Leu 


Tyr 


Cys 


Ser 


Leu 


Val 










935 








940 










945 


Trp 


Glu 


Ser 


Thr 


Val 


Leu 


Leu 


Ser 


Leu 


Cys 


Thr 


Pro 


Asn 


Ser 


Leu 








950 










955 










960 


Pro 


Ser 


Gly 


Cys 


Glu 


Phe Gly Gin Ala 


Asp 


Met 


Gin 


Lys 


Leu 


Val 








965 










970 










975 


Pro 


Lys 


Asp Glu Lys Ala Gly Thr Thr Gin Gly Gly Lys Arg 


Ser 








980 










985 










990 


Asp Gly Glu Gin Asp Gly Ala 


Ala 


Gly 


Ser 


Met 


Asp 


Ala 


Ser 


Thr 










995 










1000 










1005 


Gin 


Gly Leu 


Leu 


Glu 


Gly He 


Gly Leu Asp Gly Asp Thr Leu 


Ala 










1010 










1015 










1020 


Pro 


Met 


Glu 


Thr 


Asp 

1025 


Glu 


Pro 


Thr 


Ala 


Ser 
1030 


Asp 


Ser 


Lys 


Gly 


Lys 
1035 


Ser 


Lys 


He 


Thr 


Pro 


Ala 


Met 


Ala 


Ala 


Arg 


He 


Lys 


Gin 


He 


Lys 






1040 










1045 










1050 


Pro 


Leu 


Leu 


Ser 


Ala 


Ser 


Ser 


Arg 


Leu 


Gly Arg 


Ala 


Leu 


Ala 


Glu 










1055 










1060 










1065 
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Leu 


Phe 


Gly 


Leu Leu 








1070 


Gin 


Arg 


Arg 


Ser His 








1085 


Pro 


Ala 


Ala 


Arg Ser 








1100 


Lys 


Gly 


Leu 


Ser Trp 








1115 


Phe 


Arg 


Leu 


Thr Phe 








1130 


Met 


Leu 


Phe 


Asp Glu 








1145 


Lys 


Phe 


Leu 


Cys Ser 








1160 


Asn 


Trp 


Ala 


Leu Ser 








1175 


Leu 


Glu 


His 


Ser Asp 








1190 


Ala 


Trp 


Leu 


Met Leu 








1205 


Leu 


Glu 


Ser 


Pro His 








1220 


Gin 


Asn 


Phe 


Pro Gin 








1235 


Gin 


Lys 


Ala 


Ala Phe 








1250 


Pro 


Leu 


Lys 


Val Tyr 








1265 


lie 


Leu 


Cys 


His lie 








1280 


Leu 


Ser 


Lys 


Glu Lys 








1295 


Gin 


Glu 


Glu 


Gly Gly 








1310 


Gin 


Leu 


Gin 


Gin Leu 








1325 


Met 


Glu 


Ala 


Leu Leu 








1340 


Tyr 


Leu 


Leu 


Thr His 








1355 


Asp 


Leu 


Ser 


Met Ser 








1370 


Met 


Ser 


Leu 


Gly Gin 








1385 


Pro 


Glu 


Glu 


Val Ala 








1400 


Arg 


Glu 


Lys 


Gin Glu 








1415 


Gin 


Asp 


Ala 


Asp Pro 








1430 


Asp 


Thr 


Met 


Leu Pro 








1445 


Asp 


Thr 


Val 


Tyr Arg 








1460 


Arg 


Asn 


Gly 


Ala Asp 








1475 


Asn 


Gin 


Val 


Trp Glu 








1490 


Pro 


Leu 


Thr 


Thr Ser 








1505 


Gin 


Met 


Ala 


Thr Leu 








1520 


Leu 


Leu 


Leu 


Thr Leu 



Val 


Lys 


Leu 


Cys Val 








1075 


His 


Ala 


Ala 


Ser Thr 








1090 


Thr 


Ala 


Ser 


Ala Leu 








1105 


Gin 


Pro 


Pro 


Pro Tyr 








1120 


Phe 


lie 


Cys 


Ser Val 








1135 


Arg 


Lys 


Tyr 


Pro Tyr 








1150 


Gly 


Gly 


His 


Asn Ala 








1165 


Met 


Gly 


Gly 


Lys Val 








1180 


Leu 


Pro 


Asp 


Gly Thr 








1195 


Val 


Glu 


Lys 


Met Val 








1210 


Ser 


Leu 


Pro 


Ala Lys 








1225 


Phe 


Ser 


Ala 


Leu Arg 








1240 


Thr 


Cys 


He 


Lys Asn 








1255 


Gly 


Gly 


Arg 


Met Ala 








1270 


Leu 


Arg 


Gly 


Glu Pro 








1285 


Glu 


Gly 


Ser 


Arg Gly 








1300 


Ser 


Arg 


Arg 


Glu Pro 








1315 


Met 


Asp 


Met 


Gly Phe 








1330 


Asn 


Thr 


Ser 


Thr Met 








1345 


Pro 


Pro 


Pro 


He Met 








1360 


Glu 


Glu 


Asp 


Gin Met 








1375 


Asp 


He 


Pro 


Met Asp 








1390 


Cys 


Arg 


Lys 


Glu Glu 








1405 


Glu 


Glu 


Glu 


Ala Lys 








1420 


Leu 


Glu 


Gin 


Asp Glu 








1435 


Gly 


Cys 


Phe 


His Leu 








1450 


Val 


Cys 


Asp 


Leu He 








1465 


Tyr 


Arg 


Asp 


Met He 








1480 


Ala 


Ala 


Asp 


Val Leu 








1495 


Asp 


Thr 


Lys 


Thr Val 








1510 


Pro 


Gin 


Ala 


Ser Asn 








1525 


Leu 


Phe 


Glu 


Glu Leu 



Gly 


Ser 


Pro 


Val Arg 








1080 


Thr 


Thr 


Ala 


Pro Thr 








1095 


Thr 


Lys 


Leu 


Leu Thr 








1110 


Thr 


Pro 


Thr 


Pro Arg 








1125 


Gly 


Phe 


Thr 


Ser Pro 








- 1140 


His 


Leu 


Met 


Leu Gin 








1155 


Leu 


Phe 


Glu 


Thr Phe 








1170 


Pro 


Val 


Ser 


Glu Gly 








1185 


Gly 


Glu 


Phe 


Leu Asp 








1200 


Asn 


Pro 


Thr 


Thr Val 








1215 


Leu 


Pro 


Gly 


Gly Val 








1230 


Phe 


Leu 


Val 


Val Thr 








1245 


Leu 


Trp 


Asn 


Arg Lys 








1260 


Glu 


Ser 


Met 


Leu Ala 








1275 


Val 


He 


Arg 


Glu Arg 








1290 


Glu 


Glu 


Asp 


Thr Gly 








1305 


Gin 


Val 


Asn 


Gin Gin 








1320 


Thr 


Arg 


Glu 


His Ala 








1335 


Glu 


Gin 


Ala 


Thr Glu 








1350 


Gly 


Gly 


Val 


Val Arg 








1365 


Met 


Arg 


Ala 


He Ala 








1380 


Gin 


Arg 


Ala 


Glu Ser 








1395 


Glu 


Glu 


Arg 


Lys Ala 








1410 


Cys 


Leu 


Glu 


Lys Phe 








1425 


Leu 


His 


Thr 


Phe Thr 








1440 


Leu 


Asp 


Glu 


Leu Pro 








1455 


Met 


Thr 


Ala 


He Lys 








1470 


Leu 


Lys 


Gin 


Val Val 








1485 


He 


Lys 


Ala 


Ala Leu 








1500 


Ser 


Glu 


Trp 


He Ser 








1515 


Leu 


Ala 


Thr 


Arg He 








1530 


Lys 


Leu 


Pro 


Cys Ala 
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1535 1540 1545 

Trp Val Val Glu Ser Ser Gly lie Leu Asn Val beu He Lys Leu 
1550 1555 1560 

Leu Glu Val Val Gin Pro Cys Leu Gin Ala Ala Lys Glu Gin Lys 
1565 1570 1575 

Glu Val Gin Thr Pro Lys Trp He Thr Pro Val Leu Leu Leu He 
1580 1585 1590 

Asp Phe Tyr Glu Lys Thr Ala He Ser Ser Lys Arg Arg Ala Gin 
1595 1600 1605 

Met Thr Lys Tyr Leu Gin Ser Asn Ser Asn Asn Trp Arg Trp Phe 
1610 1615 1620 

ASP Asp Arg Ser Gly Arg Trp Cys Ser Tyr Ser Ala Ser Asn Asn 
1625 1630 1635 

Ser Thr He Asp Ser Ala Trp Lys Ser Gly Glu Thr Ser Val Arg 
1640 1645 1650 

Phe Thr Ala Gly Arg Arg Arg Tyr Thr Val Gin Phe Thr Thr Met 
1655 1660 1665 

Val Gin val Asn Glu Glu Thr Gly Asn Arg Arg Pro Val Met Leu 
1670 1675 1680 

Thr Leu Leu Arg Val Pro Arg Leu Asn Lys Asn Ser Lys Asn Ser 
1685 1690 1695 

Asn Gly Gin Glu Leu Glu Lys Thr Leu Glu Glu Ser Lys Glu Met 
1700 1705 1710 

ASP He Lys Arg Lys Glu Asn Lys Gly Asn Asp Thr Pro Leu Ala 
1715 1720 1725 

Leu Glu Ser Thr Asn Thr Glu Lys Glu Thr Ser Leu Glu Glu Thr 
1730 1735 1740 

Lvs He Gly Glu He Leu He Gin Gly Leu Thr Glu Asp Met Val 
1745 1750 1755 

Thr Val Leu He Arg Ala Cys Val Ser Met Leu Gly Val Pro Val 
1760 1765 1770 

Asp Pro Asp Thr Leu His Ala Thr Leu Arg Leu Cys Leu Arg Leu 
1775 1780 1785 

Thr Arg Asp His Lys Tyr Ala Met Met Phe Ala Glu Leu Lys Ser 
1790 1795 1800 

Thr Arg Met He Leu Asn Leu Thr Gin Ser Ser Gly Phe Asn Gly 
1805 1810 1815 

Phe Thr Pro Leu Val Thr Leu Leu Leu Arg His He He Glu Asp 
1820 1825 1830 

Pro Cys Thr Leu Arg His Thr Met Glu Lys Val Val Arg Ser Ala 
1835 1840 1845 

Ala Thr Ser Gly Ala Gly Ser Thr Thr Ser Gly Val Val Ser Gly 
1850 1855 I860 

Ser Leu Gly Ser Arg Glu He Asn Tyr He Leu Arg Val Leu Gly 
1865 1870 1875 

Pro Ala Ala Cys Arg Asn Pro Asp He Phe Thr Glu Val Ala Asn 
1880 1885 1890 

Cys Cys He Arg He Ala Leu Pro Ala Pro Arg Gly Ser Gly Thr 
1895 1900 1905 

Ala Ser Asp Asp Glu Phe Glu Asn Leu Arg He Lys Gly Pro Asn 
1910 1915 1920 

Ala Val Gin Leu Val Lys Thr Thr Pro Leu Lys Pro Ser Pro Leu 
1925 1930 1935 

Pro Val He Pro Asp Thr He Lys Glu Val He Tyr Asp Met Leu 
1940 1945 1950 

Asn Ala Leu Ala Ala Tyr His Ala Pro Glu Glu Ala Asp Lys Ser 
1955 I960 1965 

Asp Pro Lys Pro Gly Val Met Thr Gin Glu Val Gly Gin Leu Leu 
1970 1975 1980 

Gin Asp Met Gly Asp Asp Val Tyr Gin Gin Tyr Arg Ser Leu Thr 
1985 1990 1995 

Arg Gin Ser Ser Asp Phe Asp Thr Gin Ser Gly Phe Ser He Asn 
2000 2005 2010 
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Ser 


Gin 


Val 


Phe Ala 








2015 


Ser 


Gly Thr 


Ser Gin 








2030 


Asp 


Gly Lys 


Lys Asp 








2045 


I*ys 


Gin 


Lys 


Gly Lys 








2060 


lie 


Leu 


Arg 


Leu Leu 








2075 


Ala 


Thr 


Leu 


He Ala 








2090 


Leu 


He 


Lys 


Glu Asp 








2105 


Leu 


Leu 


Pro 


His Thr 








2120 


Leu 


Ala 


Arg 


Leu Phe 








2135 


Thr 


Asp 


Ala 


Gin Val 








2150 


Gly 


Arg 


Ala 


Leu Ala 








2165 


Leu 


Gin 


Ala 


Val Met 








2180 


Pro 


Ser 


Thr 


Ser Ser 








2195 


His 


Asn 


Gly 


Met Asn 








2210 


Leu 


Val 


Asn 


Asp Leu 








2225 


Ser 


Pro 


Asn 


Met Ala 








2240 


Glu 


Thr 


Leu 


Ser Arg 








2255 


Ser 


Lys 


Ser 


Ala Ser 








2270 


Gly 


Ala 


Ser 


Gin Asp 








2285 


Pro 


Gly Glu 


Ala Glu 








2300 


Thr 


Glu 


Val 


Ala Asp 








2315 


Asp 


Ser 


Val 


Val He 








2330 


Glu 


Met 


Gin 


Val Glu 








2345 


Leu 


Glu 


Arg 


Asp Gly 








2360 


Arg 


Ser 


Gly 


Glu Asp 








2375 


Ala 


Pro 


Ser 


Asn Leu 








2390 


Glu 


Asp 


Ser 


Met Asn 








2405 


Thr 


Gin 


Glu 


Glu Asp 








2420 


Ser 


Gin 


Asp 


Glu Glu 








2435 


Gin 


Glu 


Asp 


Asp Glu 








2450 


Asp 


Gly 


Ser 


Glu Met 








2465 


Ala 


Ser 


Pro 


Leu Val 



Ala 


Asp 


Gly 


Ala Ser 








2020 


Gly 


Glu 


Ala 


Ser Thr 








2035 


Lys 


Glu 


Gly 


Asp Arg 








2050 


Gly 


Ser 


Lys 


Pro Leu 








2065 


Ala 


Glu 


Leu 


Val Arg 








2080 


Asn 


Tyr 


Ser 


Tyr Thr 








2095 


Cys 


Ser 


Val 


Leu Ala 








2110 


Gin 


Asn 


Ala 


Glu Asp 








2125 


Leu 


Ala 


Ser 


Leu Ala 








2140 


Ala 


Leu 


Val 


Asn Glu 








2155 


Met 


Ala 


Glu 


Ser Thr 








2170 


Cys 


He 


He 


Ser Thr 








2185 


Phe 


Tyr 


Ser 


Ser Ala 








2200 


Asn 


He 


He 


Arg Leu 








2215 


Ala 


Arg 


Val 


Pro His 








2230 


Asn 


Thr 


Val 


Asn Ala 








2245 


He 


Val 


Asn 


Gin Pro 








2260 


Ser 


Lys 


Asn 


Lys Ser 








2275 


Ser 


Ser 


Ser 


Asn Gin 








2290 


Val 


Gin 


Glu 


Glu Asp 








2305 


Gly 


Asp 


He 


Met Asp 








2320 


Ala 


Gly 


Gin 


Pro Glu 








2335 


Asn 


Glu 


Leu 


Glu Asp 








2350 


Gly 


Ser 


Gly 


Asn Ser 








2365 


Glu 


Ser 


Gin 


Glu Asp 








2380 


Ser 


Gin 


Ala 


Ser Thr 








2395 


He 


Leu 


Asp 


Pro Glu 








2410 


Ser 


Ser 


Gly 


Ser Asn 








2425 


Glu 


Glu 


Glu 


Glu Glu 








2440 


Gly 


Glu 


Glu 


Gly Asp 








2455 


Glu 


Leu 


Asp 


Glu Asp 








2470 


Arg 


Phe 


Glu 


Arg Phe 



Thr 


Glu 


Thr 


Ser Ala 








2025 


Pro 


Glu 


Glu 


Ser Arg 








2040 


Ala 


Ser 


Glu 


Glu Gly 








2055 


Met 


Pro 


Thr 


Ser Thr 








2070 


Ser 


Tyr 


Val 


Gly He 








2085 


Val 


Gly 


Gin 


Ser Glu 








2100 


Phe 


Val 


Leu 


Asp His 








2115 


Lys 


Asp 


Thr 


Pro Ala 








2130 


Ala 


Ala 


Gly 


Ser Gly 








2145 


Val 


Lys 


Ala 


Ala Leu 








2160 


Glu 


Lys 


His 


Ala Arg 








2175 


He 


Met 


Glu 


Ser Cys 








2190 


Thr 


Ala 


Lys 


Thr Gin 








2205 


Phe 


Leu 


Lys 


Lys Gly 








2220 


Ser 


Leu 


Asp 


Leu Ser 








2235 


Ala 


Leu 


Lys 


Pro Leu 








2250 


Ser 


Ser 


Leu 


Phe Gly 








2265 


Glu 


Gin 


Asp 


Ala Gin 








2280 


Gin 


Asp 


Pro 


Gly Glu 








2295 


His 


Asp 


Val 


Thr Gin 








2310 


Gly 


Glu 


Ala 


Glu Thr 








2325 


Val 


Leu 


Ser 


Ser Gin 








2340 


Leu 


He 


Asp 


Glu Leu 








2355 


Thr 


He 


He 


Val Ser 








2370 


Val 


Leu 


Met 


Asp Glu 








2385 


Leu 


Gin 


Ala 


Asn Arg 








2400 


Asp 


Glu 


Glu 


Glu His 








2415 


Glu 


Asp 


Glu 


Asp Asp 








2430 


Asp 


Glu 


Glu 


Asp Asp 








2445 


Glu 


Asp 


Asp 


Asp Asp 








2460 


Tyr 


Pro 


Asp 


Met Asn 








2475 


Asp 


Arg 


Glu 


Asp Asp 
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2480 








2485 








2490 


Leu 


lie 


lie 


Glu Phe 


Asp 


Asn 


Met 


Phe Ser 


Ser 


Ala 


Thr 


Asp He 








2495 






2500 








2505 


Pro 


Pro 


Ser 


Pro Gly Asn 


He 


Pro 


Thr Thr 


His 


Pro 


Leu 


Met Val 








2510 








2515 








2520 


Arg 


His 


Ala Asp His 


Ser 


Ser 


Leu 


Thr Leu Gly 


Ser 


Gly 


Ser Ser 






2525 








2530 








2535 


Thr 


Thr 


Arg 


Leu Thr 


Gin 


Gly 


He 


Gly Arg 


Ser 


Gin 


Arg 


Thr Leu 






2540 








2545 








2550 


Arg 


Gin 


Leu 


Thr Ala 


Asn 


Thr Gly His Thr 


He 


His 


Val 


His Tyr 






2555 








2560 








2565 


Pro 


Gly Asn Arg Gin 


Pro 


Asn 


Pro 


Pro Leu 


He 


Leu 


Gin 


Arg Leu 








2570 








2575 








2580 


Leu Gly 


Pro 


Ser Ala 


Ala 


Ala 


Asp 


He Leu 


Gin 


Leu 


Ser 


Ser Ser 








2585 








2590 








2595 


Leu 


Pro 


Leu 


Gin Ser 


Arg 


Gly Arg Ala Arg 


Leu 


Leu 


Val 


Gly Asn 








2600 








2605 








2610 


Asp 


Asp 


Val 


His He 


He 


Ala 


Arg 


Ser Asp 


Asp 


Glu 


Leu 


Leu Asp 




2615 








2620 








2625 


Asp 


Phe 


Phe 


His Asp 


Gin 


Ser 


Thr 


Ala Thr 


Ser 


Gin 


Ala 


Gly Thr 






2630 








2635 








2640 


Leu 


Ser 


Ser 


lie Pro 


Thr 


Ala 


Leu 


Thr Arg 


Tarp 


Thr 


Glu 


Glu Cys 








2645 








2650 








2655 


Lys 


Val 


Leu 


Asp Ala 


Glu 


Ser 


Met 


His Asp 


Cys 


Val 


Ser 


Val Val 






2660 








2665 








2670 


Lys 


Val 


Ser 


He Val 


Asn 


His 


Leu 


Glu Phe 


Leu 


Arg 


Asp 


Glu Glu 






2675 








2680 








2685 


Leu 


Glu 


Glu 


Arg Arg 


Glu 


Lys 


Arg 


Arg Lys 


Gin 


Leu 


Ala 


Glu Glu 








2690 








2695 








2700 


Glu 


Thr 


Lys 


He Thr 


Asp 


Lys 


Gly 


Lys Glu 


Asp 


Lys 


Glu 


Asn Arg 






2705 








2710 








2715 


Asp 


Gin 


Ser 


Ala Gin 


Cys 


Thr 


Ala 


Ser Lys 


Ser 


Asn 


Asp 


Ser Thr 






2720 








2725 








2730 


Glu 


Gin 


Asn 


Leu Ser 


Asp 


Gly Thr 


Pro Met 


Pro 


Asp 


Ser 


Tyr Pro 








2735 






2740 








2745 


Thr 


Thr 


Pro 


Ser Ser 


Thr 


Asp 


Ala 


Ala Thr 


Ser 


Glu 


Ser 


Lys Glu 








2750 








2755 








2760 


Th.r 


Leu 


Gly Thr Leu 


Gin 


Ser 


Ser 


Gin Gin 


Gin 


Pro 


Thr 


Leu Pro 








2765 








2770 








2775 


Thr 


Pro 


Pro 


Ala Leu 


Gly 


Glu 


Val 


Pro Gin 


Glu 


Leu 


Gin 


Ser Pro 








2780 






2785 








2790 


Ala 


Gly Glu Gly Gly Ser 


Ser 


Thr 


Gin Leu 


Leu 


Met 


Pro 


Val Glu 








2795 








2800 








2805 


Pro 


Glu 


Glu 


Leu Gly 


Pro 


Thr 


Arg 


Pro Ser 


Gly Glu Ala 


Glu Thr 








2810 








2815 








2820 


Thr 


Gin 


Met 


Glu Leu 


Ser 


Pro 


Ala 


Pro Thr 


He 


Thr 


Ser 


Leu Ser 








2825 








2830 








2835 


Pro 


Glu Arg Ala Glu 


Asp 


Ser 


Asp 


Ala Leu 


Thr 


Ala 


Val 


Ser Ser 








2840 








2845 








2850 


Gin 


Leu Glu Gly Ser 


Pro 


Met 


Asp 


Thr Ser 


Ser 


Leu 


Ala 


Ser Cys 








2855 








2860 








2865 


Thr 


Leu 


Glu 


Glu Ala 


Val 


Gly Asp 


Thr Ser 


Ala 


Ala 


Gly 


Ser Ser 








2870 








2875 








2880 


Glu 


Gin 


Pro 


Arg Ala 


Gly 


Ser 


Ser 


Thr Pro Gly Asp Ala 


Pro Pro 








2885 








2890 








2895 


Ala 


Val 


Ala 


Glu Val 


Gin 


Gly Arg 


Ser Asp 


Gly 


Ser 


Gly Glu Ser 








2900 








2905 








2910 


Ala 


Gin 


Pro 


Pro Glu 


Asp 


Ser 


Ser 


Pro Pro 


Ala 


Ser 


Ser 


Glu Ser 








2915 








2920 








2925 


Ser 


Ser 


Thr 


Arg Asp 


Ser 


Ala 


Val 


Ala He 


Ser 


Gly 


Ala 


Asp Ser 








2930 








2935 








2940 


Ara Gly lie 


Leu Glu 


Glu 


Pro 


Leu 


Pro Ser 


Thr 


Ser 


Ser 


Glu Glu 








2945 








2950 








2955 
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Glu 


Asp 


Pro 


Leu Ala 








2960 


Ser 


Phe 


Leu 


Ala Ala 








2975 


Gin 


Asn 


Gin 


Leu Gly 








2990 


Thr 


Asn 


Ser 


Ser Ala 








3005 


Glu 


Val 


Ser 


Pro Glu 








3020 


Glu 


Glu 


Val 


Leu Ala 








3035 


Leu 


Ala 


Gin 


Asn Ala 








3050 


Phe 


lie 


Gin 


Thr Leu 








3065 


Asp 


Met 


Glu 


Asp Ser 








3080 


Ala 


Glu 


Ala 


Gin Ala 








3095 


Gin 


Leu 


Met 


His Glu 








3110 


Leu 


Ser 


Ala 


He Leu 








3125 


Gly 


Asn 


Arg 


Gly Val 








3140 


Gly 


Thr 


Phe 


Gin Met 








3155 


Gly 


Ser 


Asn 


Val Asp 








3170 


Leu 


Asp 


His 


Glu Ala 








3185 


Asp 


Glu 


Pro 


Lys Leu 








3200 


Asn 


Leu 


Cys 


Tyr His 








3215 


Leu 


Leu 


Ser 


He Leu 








3230 


Glu 


Thr 


Pro 


Lys Leu 








3245 


Ser 


Lys 


Ser 


Cys Gly 








3260 


Leu 


Leu 


His 


Lys Met 








3275 


Leu 


Ser 


Val 


Ser Met 








3290 


Phe 


Gin 


He 


Gin Arg 








3305 


Ala 


Ser Gly 


Gly Ser 








3320 


Val 


Val 


Cys 


Arg His 








3335 


Val 


Phe 


Pro 


Ser His 








3350 


Cys 


Glu 


Ser 


Asp Arg 








3365 


Ser 


Ser 


Gin 


Ser Ser 








3380 


Leu 


Leu 


Val 


Lys Leu 








3395 


Asn 


Ser 


Val 


Lys Ser 








3410 


Thr 


Ser 


Pro 


Tyr Ser 



Gly 


He 


Ser 


Leu Pro 








2965 


Leu 


Pro 


Asp 


Asp He 








2980 


He 


Arg 


Pro 


Pro Thr 








2995 


Pro 


Ala 


Val 


Val Gly 








3010 


Phe 


Leu 


Ala 


Ala Leu 








3025 


Gin 


Gin 


Arg 


Ala Glu 








3040 


Ser 


Ser 


Asp 


Thr Pro 








3055 


Pro 


Ser 


Asp 


Leu Arg 








3070 


Val 


Leu 


Ala 


Val Met 








3085 


Leu 


Arg 


Arg 


Glu Gin 








3100 


Arg 


Leu 


Phe 


Gly His 








3115 


Arg 


Ser 


Pro 


Ala Phe 








3130 


Gin 


Tyr 


Thr 


Arg Leu 








3145 


Gly 


Gly 


Ser 


Ser Ser 








3160 


Thr 


Leu 


Leu 


Arg Leu 








3175 


Leu 


Ser 


Cys 


Leu Leu 








3190 


Asn 


Thr 


Ser 


Arg Leu 








3205 


Ala 


Gin 


Thr 


Arg His 








3220 


Gin 


Arg 


Ser 


Ser Glu 








3235 


Thr 


Thr 


Ser 


Glu Glu 








3250 


Ser 


Ser 


Ser 


His Glu 








3265 


Glu 


Ser 


Lys 


Ser Ser 








3280 


Asp 


Ala 


Ala 


Leu Gly 








3295 


Ser 


Gly 


Gly 


Arg Lys 








3310 


Thr 


Val 


His 


He His 








3325 


Val 


Leu 


Asp 


Thr Leu 








3340 


Phe 


Thr 


Gin 


Gin Arg 








3355 


Glu 


Arg 


Gly 


Asn Lys 








3370 


Ser 


Ser 


Gly 


He Cys 








3385 


Asp 


Asn 


Met 


Asn Val 








3400 


Val 


Pro 


Val 


Ser Ala 








3415 


Leu 


Glu 


Ala 


Ser Pro 



Glu 


Gly Val 


Asp Pro 








2970 


Arg 


Arg 


Glu 


Val Leu 
2985 


Arg 


Thr 


Ala 


Pro Ser 
3000 


Asn 


Pro 


Gly 


Val Thr 
3015 


Pro 


Pro 


Ala 


He Gin 
3030 


Gin 


Gin 


Arg 


Arg Glu 

3045 


Met 


Asp 


Pro 


Val Thr 
3060 


Arg 


Ser 


Val 


Leu Glu 
3075 


Pro 


Pro 


Asp 


He Ala 
3090 


Glu 


Ala 


Arg 


Gin Arg 
3105 


Ser 


Ser 


Thr 


Ser Ala 
3120 


Thr 


Ser 


Arg 


Leu Ser 
3135 


Ala 


Val 


Gin 


Arg Gly 
3150 


His 


Asn 


Arg 


Pro Ser 
3165 


Arg 


Gly Arg 


Leu Leu 








3180 


Val 


Leu 


Leu 


Phe Val 
3195 


His 


Arg Val 


Leu Arg 








3210 


Trp 


Val 


He 


Arg Ser 

3225 


Ser 


Glu 


Leu 


Cys He 
3240 


Lys 


Gly Lys 


Lys Ser 








3255 


Asn 


Arg 


Pro 


Leu Asp 

3270 


Asn 


Gin 


Leu 


Ser Trp 
3285 


Cys 


Arg 


Thr 


Asn He 
3300 


His 


Thr 


Glu 


Lys His 
3315 


Pro 


Gin 


Ala 


Ala Pro 
3330 


He 


Gin 


Leu 


Ala Lys 
3345 


Thr 


Lys 


Glu 


Thr Asn 
3360 


Ala 


Cys 


Ser 


Pro Cys 
3375 


Thr 


Asp 


Phe 


Trp Asp 
3390 


Ser 


Arg 


Lys 


Gly Lys 
3405 


Gly 


Gly Glu 


Gly Glu 








3420 


Leu 


Gly Gin 


Leu Met 
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3425 


Asn 


Met 


Leu 


Ser His 
3440 


Glu 


Lys 


Leu 


Leu Arg 
3455 


Glu 


Asn 


Lys 


Val Ser 
3470 


Ser 


Ser 


Thr 


Thr Thr 
3485 


Thr 


Ala 


Ala 


Ser Thr 
3500 


Val 


Thr 


Ser 


Ala Pro 
3515 


lie 


Val 


Val 


Ala Ala 
3530 


Thr 


Thr 


Thr 


Val Ser 
3545 


Pro 


Ala 


Lys 


Val Ser 
3560 


Met 


Val 


Ser 


Ser Gly 
3575 


Glu 


Val 


Leu 


Thr Ser 
3590 


Ala 


Ala 


Asn 


Val Leu 

3605 


Arg 


Asp 


Thr 


Val Leu 
3620 


Gly Tyr 


Thr 


Leu Cys 








3635 


Arg 


Glu 


Tyr 


Asn Leu 
3650 


Leu 


Ser 


Pro 


Asp Gly 
3665 


Leu 


Lys 


Gly 


Lys Met 
3680 


Val 


He 


Val 


Ala Ser 
3695 


Gin 


Leu 


Pro 


Ser Met 
3710 


Lys 


Phe 


Phe 


Leu Arg 
3725 


Asp Thr 


Arg 


Arg Ala 








3740 


Gly Ser 


Ser 


Gly Leu 








3755 


Arg 


Gin 


Leu 


Glu Ala 
3770 


Glu Gly 


Gin 


Arg Ala 








3785 


Ser 


Ser 


Gin 


Ser Glu 
3800 


Asp Val 


Asp 


Gin Pro 








3815 


Ala 


Ser 


Asp 


Gly Thr 
3830 


Pro 


Pro 


Glu 


Leu Pro 
3845 


Leu 


Trp 


Asp 


Met Leu 
3860 


His 


Asp 


Gin 


His Ala 
3875 


Phe 


Phe 


Leu 


Val His 
3890 









3430 


Pro 


Val 


He 


Arg Arg 
3445 


Leu 


Leu 


Ser 


Leu He 
3460 


Glu 


Ala 


Gin 


Ala Asn 
3475 


Ala 


Thr 


Ser 


Thr Thr 
3490 


Thr 


Pro 


Thr 


Pro Pro 
3505 


Ala 


Leu 


Val 


Ala Ala 
3520 


Ser 


Thr 


Thr 


Val Thr 
3535 


He 


Ser 


Pro 


Thr Thr 
3550 


Asp 


Gly 


Gly 


Ser Ser 
3565 


Leu 


Thr 


Glu 


Asn Gin 
3580 


His 


Ser 


Cys 


Ser Glu 
3595 


Leu 


Gin 


Leu 


Ser Arg 
3610 


Lys 


Leu 


Leu 


Leu Asn 
3625 


Lys 


Gin 


He 


Gly Thr 
3640 


Glu 


Gin 


Gin 


Arg Arg 
3655 


Leu 


Pro 


Glu 


Glu Gin 
3670 


Gin 


Ser 


Arg 


Phe Asp 
3685 


Gin 


Lys 


Arg 


Pro Leu 
3700 


Ser 


Met 


Leu 


Thr Ser 
3715 


Val 


Leu 


Gin 


Val He 
3730 


Asn 


Lys 


Lys 


Ala Lys 
3745 


Gly 


Ser 


Ala 


Ser Ser 
3760 


Glu 


Ala 


Asp 


Ala He 
3775 


Arg 


Arg 


Gin 


Gin Gin 
3790 


Ala 


Ser 


Val 


Arg Arg 
3805 


Ser 


Pro 


Ser 


Ala Gin 
3820 


Pro 


Gin 


Gly 


Glu Lys 
3835 


Leu 


Leu 


Ser 


Glu Gin 
3850 


Gly Glu 


Cys 


Leu Lys 








3865 


Val 


Leu 


Val 


Leu Gin 
3880 


Ala 


Thr 


Glu 


Arg Glu 
3895 









3435 


Ser 


Ser 


Leu 


Leu Thr 








3450 


Ser 


He 


Ala 


Leu Pro 








3465 


Ser 


Gly 


Ser 


Gly Ala 








3480 


Ser 


Thr 


Thr 


Thr Thr 








3495 


Thr 


Ala 


Pro 


Thr Pro 








3510 


Thr 


Ala 


He 


Ser Thr 








3525 


Thr 


Pro 


Thr 


Thr Ala 








3540 


Lys 


Gly 


Ser 


Lys Ser 








3555 


Ser 


Thr 


Asp 


Phe Lys 








3570 


Leu 


Gin 


Leu 


Ser Val 








3585 


Glu 


Gly 


Leu 


Glu Asp 








3600 


Gly 


Asp 


Ser 


Gly Thr 








3615 


Gly 


Ala 


Arg 


His Leu 








3630 


Leu 


Leu 


Ala 


Glu Leu 








3645 


Ala 


Gin 


Cys 


Glu Thr 








3660 


Pro 


Gin 


Thr 


Thr Lys 








3675 


Met 


Ala 


Glu 


Asn Val 








3690 


Gly 


Gly 


Arg 


Glu Leu 








3705 


Lys 


Thr 


Ser 


Thr Gin 








3720 


He 


Gin 


Leu 


Arg Asp 








3735 


Gin 


Thr 


Gly 


Arg Leu 








3750 


He 


Gin 


Ala 


Ala Val 








3765 


He 


Gin 


Met 


Val Arg 








3780 


Ala 


Ala 


Thr 


Ser Glu 








3795 


Glu 


Glu 


Ser 


Pro Met 








3810 


Asp 


Thr 


Gin 


Ser He 








3825 


Glu 


Lys 


Glu 


Glu Arg 








3840 


Leu 


Ser 


Leu 


Asp Glu 








3855 


Glu 


Leu 


Glu 


Glu Ser 








3870 


Pro 


Ala 


Val 


Glu Ala 








3885 


Ser 


Lys 


Pro 


Pro Val 








3900 
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Arg 


Asp 


Thr 


Arg Glu 


Ser 


Gin 


Leu 


Ala His 


He 


Lys 


Asp 


Glu Pro 




3905 








3910 








3915 


Pro 


Pro 


Leu 


Ser Pro 


Ala 


Pro 


Leu 


Thr Pro 


Ala 


Thr 


Pro 


Ser Ser 








3920 








3925 








3930 


Leu 


Asp 


Pro 


Phe Phe 


Ser 


Arg 


Glu 


Pro Ser 


Ser 


Met 


His 


He Ser 






3935 








3940 








3945 


Ser 


Ser 


Leu 


Pro Pro 


Asp 


Thr 


Gin 


Lys Phe 


Leu 


Arg 


Phe 


Ala Glu 








3950 






3955 








3960 


Thr 


His 


Arg 


Thr Val 


Leu 


Asn 


Gin 


He Leu 


Arg 


Gin 


Ser 


Thr Thr 






3965 








3970 








3975 


His 


Leu 


Ala 


Asp Gly 


Pro 


Phe 


Ala 


Val Leu 


Val 


Asp 


Tyr 


He Arg 








3980 








3985 








3990 


Val 


Leu 


Asp 


Phe Asp 


Val 


Lys 


Arg 


Lys Tyr 


Phe 


Arg 


Gin 


Glu Leu 






3995 








4000 








4005 


Glu 


Arg 


Leu 


Asp Glu 


Gly Leu Arg 


Lys Glu 


Asp 


Met 


Ala 


Val His 






4010 








4015 








4020 


Val 


Arg 


Arg 


Asp His 


Val 


Phe 


Glu 


Asp Ser 


Tyr 


Arg 


Glu 


Leu His 




4025 








4030 








4035 


Arg 


Lys 


Ser 


Pro Glu 


Glu 


Met 


Lys 


Asn Arg 


Leu 


Tyr 


He 


Val Phe 




4040 








4045 








4050 


Glu Gly Glu Glu Gly Gin Asp Ala Gly Gly Leu Leu Arg 


Glu Trp 








4055 








4060 








4065 


Tyr 


Met 


lie 


He Ser 


Arg 


Glu 


Met 


Phe Asn 


Pro 


Met 


Tyr 


Ala Leu 






4070 








4075 








4080 


Phe 


Arg 


Thr 


Ser Pro Gly Asp Arg Val Thr 


Tyr 


Thr 


He 


Asn Pro 






4085 








4090 








4095 


Ser 


Ser 


His 


Cvs Asn 


Pro 


Asn 


His 


Leu Ser 


Tyr 


Phe 


Lys 


Phe Val 








4100 








4105 








4110 


Gly Arg 


lie 


Val Ala 


Lys 


Ala 


Val 


Tyr Asp 


Asn 


Arg 


Leu 


Leu Glu 








4115 








4120 








4125 


Cys 


Tyr 


Phe 


Thr Arg 


Ser 


Phe 


Tyr 


Lys His 


He 


Leu 


Gly 


Lys Ser 




4130 








4135 








4140 


Val 


Arg 


Tyr 


Thr Asp 


Met 


Glu 


Ser 


Glu Asp 


Tyr 


His 


Phe 


Tyr Gin 




4145 








4150 








4155 


Gly Leu 


Val 


Tyr Leu 


Leu 


Glu 


Asn 


Asp Val 


Ser 


Thr 


Leu 


Gly Tyr 








4160 








4165 








4170 


Asp 


Leu 


Thr 


Phe Ser 


Thr 


Glu 


Val 


Gin Glu 


Phe 


Gly Val 


Cys Glu 






4175 








4180 








4185 


Val 


Arg 


Asp 


Leu Lys 


Pro 


Asn Gly Ala Asn 


He 


Leu 


Val 


Thr Glu 




4190 








4195 








4200 


Glu 


Asn 


Lys 


Lys Glu 


Tyr 


Val 


His 


Leu Val 


Cys 


Gin 


Met 


Arg Met 






4205 








4210 








4215 


Thr Gly Ala 


He Arg 


Lys 


Gin 


Leu 


Ala Ala 


Phe 


Leu 


Glu 


Gly Phe 








4220 








4225 








4230 


Tyr 


Glu 


lie 


He Pro 


Lys 


Arg 


Leu 


He Ser 


He 


Phe 


Thr 


Glu Gin 






4235 








4240 








4245 


Glu 


Leu 


Glu 


Leu Leu 


He 


Ser 


Gly 


Leu Pro 


Thr 


He 


Asp 


He Asp 








4250 








4255 








4260 


Asp 


Leu 


Lys 


Ser Asn 


Thr 


Glu 


Tyr 


His Lys 


Tyr 


Gin 


Ser 


Asn Ser 




4265 








4270 








4275 


lie 


Gin 


He 


Gin Trp 


Phe 


Trp 


Arg 


Ala Leu 


Arg 


Ser 


Phe 


Asp Gin 








4280 








4285 








4290 


Ala 


Asp 


Arg 


Ala Lys 


Phe 


Leu 


Gin 


Phe Val 


Thr Gly Thr 


Ser Lys 






4295 








4300 








4305 


Val 


Pro 


Leu 


Gin Gly 


Phe 


Ala 


Ala 


Leu Glu 


Gly Met 


Asn 


Gly He 








4310 








4315 










Gin 


Lys 


Phe 


Gin He 


His 


Arg 


Asp 


Asp Arg 


Ser 


Thr 


Asp 


Arg Leu 






4325 








4330 








4335 


Pro 


Ser 


Ala 


His Thr 


Cys 


Phe 


Asn 


Gin Leu 


Asp 


Leu 


Pro 


Ala Tyr 








4340 








4345 








4350 


Glu 


Ser 


Phe 


Glu Lys 


Leu 


Arg 


His 


Met Leu 


Leu 


Leu 


Ala 


He Gin 








4355 








4360 








4365 


Glu 


Cys 


Ser 


Glu Gly 


Phe 


Gly Leu 


Ala 
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<220> 

<221> misc_feature 
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Met 


Ala 


Ala 


Trp 


Lys 


Ser 


Trp 


1 








5 






Val 


Val 


Val 


Leu 


Asp 
20 


Met 


Val 


Leu 


Asp 


Glu 


Ser 


Phe 
35 


Lys 


Glu 


Val 


Asp 


Phe 


Tyr 


Ala 
50 


Pro 


Trp 


Pro 


He 


Trp 


Asn 


Glu 
65 


Val 


Gly 


Pro 


Val 


Lys 


Val 


Gly 
80 


Lys 


Met 



Thr 


Ala 


Leu 
10 


Arg 


Leu 


Cys 


Ala 


Thr 
15 


Val 


Cys 


Lys 
25 


Gly 


Phe 


Val 


Glu 


Asp 
30 


Asn 


Arg 


Asn 
40 


Asp 


Asp 


He 


Trp 


Leu 
45 


Cys 


Gly 


His 


Cys 


Lys 


Lys 


Leu 


Glu 




55 










60 


Leu 


Glu 


Met 
70 


Lys 


Ser 


He 


Gly 


Ser 
75 


Asp 


Ala 


Thr 
85 


Ser 


Tyr 


Ser 


Asn 
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Met 


Pro 


Pro 


Phe 


Leu 


He 


Thr 


1 








5 






Leu 


Arg 


Ala 


Asn 


Gly His 


Leu 










20 






Thr 


Glu 


Phe 


Ala 


Leu 


His 


Leu 










35 






Asn 


Glu 


Thr 


Asn 


Phe 


Val 


He 










50 






Leu 


Glu 


He 


Leu 


Gin 


Phe 


Gly 










65 






Leu 


Ala 


Asp 


Ala 


Leu 


Gly 


Tyr 










80 






Asp 


Phe 


Leu 


His 


Ala 


Val 


Tyr 










95 






Gly Thr Glu 


Met 


Glu 


Leu 


Ala 










110 






Thr 


Pro 


Leu 


Ser 


Pro 


Cys 


Phe 










125 






Asn 


Ser 


Ser 


Leu 


Glu 


Pro 


Ala 










140 






Ala 


He 


Gin 


Thr 


Ser 


Glu 


Gly 










155 






Gly 


Pro 


Ser 


Glu 


Gly 


Pro 


Gly 










170 






Ala 


Ala 


Phe 


Ala 


Gin 


Leu 


Val 










185 






Gly 


Thr 


Trp 


Arg 


Lys 


Arg 


Phe 










200 







Leu 


Phe 


Leu 


Phe 


His 


Ser 


Cys 


Cys 






10 










15 


Arg 


Glu 


Gly Met 


Thr 


Leu 


Leu 


Lys 




25 










30 


Tyr 


Gin 


Ser 


Val 


Ala 


Ala 


Cys 


Arg 




40 










45 


Ser 


Pro 


Ala 


Gly Val 


Ser 


Leu 


Pro 






55 










50 


Ala 


Glu Gly 


Ser 


Thr 


Gly Gin Gin 






70 










75 


Thr 


Val 


His 


Asp 


Lys 


Arg 


Val 


Lys 






85 










90 


Ala 


Thr 


Leu 


Pro 


Thr 


Ser 


Ser 


Gin 






100 










105 


Cys 


Ser 


Leu 


Phe 


Val 


Gin 


Val 


Gly 




115 
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He 
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He 


Asn 


Val 


Thr 


Gly 
170 


Val 


Trp 


Glu 


Arg 


Asn 
175 


Val 


Thr 


Gly 


Arg 


Gly 
180 


Val 


Thr 


Val 


Val 


Val 
185 


Val 


Asp 


Asp 


Gly 


Val 
190 


Glu 
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Thr 


He 


Gin 
195 


Asp 


He 


Ala 


Pro 


Asn 
200 


Tyr 


Ser 


Pro 


Glu 


Gly 
205 


Ser 


Tyr 


Asp 


Leu 


Asn 
210 


Ser 


Asn 


Asp 


Pro 


Asp 
215 


Pro 


Met 


Pro 


His 


Pro 
220 


Asp 


Val 


Glu 


Asn 


Gly 
225 


Asn 


His 


His 


Gly 


Thr 
230 


Arg 


Cys 


Ala 


Gly 


Glu 
235 


He 


Ala 


Ala 


Val 


Pro 
240 


Asn 


Asn 


Ser 


Phe 


Cys 
245 


Ala 


Val 


Gly 


Val 


Ala 

250 


Tyr 


Gly 


Ser 


Arg 


He 
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Ala 


Gly 


He 


Arg 


Val 
260 


Leu 


Asp 


Gly 


Pro 


Leu 
265 


Thr 


Asp 


Ser 


Met 


Glu 
270 


Ala 


Val 


Ala 


Phe 


Asn 
275 


Lys 
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Tyr 


Gin 


He 
280 


Asn- 


Asp 


He 


Tyr 


Ser 
285 


Cys 


Arg 


Tyr 


Leu 


Phe 
290 


Leu 


Asn 


Ser 


Trp 


Gly 
295 


Pro 


Asp 


Asp 


Asp 


Gly 
300 


Lys 


Thr 


Val 


Asp 


Gly 
305 


Pro 


His 


Gin 


Leu 


Gly 
310 


Lys 


Ala 


Ala 


Leu 


Gin 
315 


His 


Gly 


Val 


He 


Ala 
320 


Gly 


Arg 


Gin 


Gly 


Phe 
325 


Gly 


Ser 


He 


Phe 


Val 
330 


Val 


Ala 


Ser 


Gly 


Asn 
335 


Gly 


Gly 


Gin 


His 


Asn 
340 


Asp 


Asn 


Cys 


Asn 


Tyr 
345 


Asp 


Gly 


Tyr 


Ala 


Asn 
350 


Ser 


He 


Tyr 


Thr 


Val 
355 


Thr 


He 


Gly 


Ala 


Val 
360 


Asp 


Glu 


Glu 


Gly 


Arg 
365 


Met 


Pro 


Phe 


Tyr 


Ala 
370 


Glu 


Glu 


Cys 


Ala 


Ser 
375 


Met 


Leu 


Ala 


Val 


Thr 
380 


Phe 


Ser 


Gly 


Gly 


Asp 
385 


Lys 


Met 


Leu 


Arg 


Ser 
390 


He 


Val 


Thr 


Thr 


Asp 


Trp 


Asp 


Leu 


Gin 


Lys 


Gly Thr 


Gly 


Cys 


Thr 
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400 










405 


Glu 


Gly 


His 


Thr 


Gly 


Thr 


Ser 


Ala 


Ala 


Ala 


Pro 


Leu 


Ala 


Ala 


Gly 








410 










415 










420 


Met 


He 


Ala 


Leu 


Met 
425 


Leu 


Gin 


Val 


Arg 


Pro 
430 


Cys 


Leu 


Thr 


Trp 


Arg 
435 


Asp 


Val 


Gin 


His 


He 
440 


He 


Val 


Phe 


Thr 


Ala 
445 


Thr 


Arg 


Tyr 


Glu 


Asp 
450 


Arg 


Arg 


Ala 


Glu 


Trp 
455 


Val 


Thr 


Asn 


Glu 


Ala 
460 


Gly 


Phe 


Ser 


His 


Ser 
465 


His 


Gin 


His 


Gly 


Phe 


Gly 


Leu 


Leu 


Asn 


Ala 


Trp Arg 


Leu 


Val 


Asn 
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Ala 


Ala 
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He 


Trp 
485 


Thr 


Ser 


Val 


Pro 


Tyr 
490 


Leu 


Ala 


Ser 


Tyr 


Val 
495 


Ser 


Pro 


Val 


Leu 


Lys 
500 


Glu 


Asn 


Lys 


Ala 


He 

505 


Pro 


Gin 


Ser 


Pro 


Arg 
510 


Ser 


Leu 


Glu 


Val 


Leu 
515 


Trp 


Asn 


Val 


Ser 


Arg 
520 


Met 


Asp 


Leu 


Glu 


Met 
525 


Ser 


Gly 


Leu 


Lys 


Thr 


Leu 


Glu 


His 


Val 


Ala 


Val 


Thr 


Val 


Ser 


He 








530 










535 










540 


Thr 


His 


Pro 


Arg 


Arg 
545 


Gly 


Ser 


Leu 


Glu 


Leu 
550 


Lys 


Leu 


Phe 


Cys 


Pro 
555 


Ser 


Gly 


Met 


Met 


Ser 
560 


Leu 


He 


Gly 


Ala 


Pro 
565 


Arg 


Ser 


Met 


Asp 


Ser 
570 


Asp 


Pro 


Asn 


Gly 


Phe 
575 


Asn 


Asp 


Trp 


Thr 


Phe 

con 
580 


Ser 


Thr 


Val 


Arg 


Cys 

Q 


Trp 


Gly 


Glu 


Arg 


Ala 
590 


Arg 


Gly 


Thr 


Tyr 


Arg 
595 


Leu 


Val 


He 


Arg 


Asp 
600 


Val 


Gly 


Asp 


Glu 


Ser 


Phe 


Gin 


Val 


Gly 


He 


Leu 


Arg 


Gin 


Tarp 


Gin 
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Leu 


Thr 


Leu 


Tyr 


Gly 
620 


Ser 


Val 


Trp 


Ser 


Ala 
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Val 


Asp 


He 


Arg 


Asp 
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Gin 
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Leu 


Leu 
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Met 


Ser 


Gly Lys 


Tyr 


Leu 


His 
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635 










640 










645 


Asp 


Asp 


Phe Ala 


Leu 


Pro 


Cys 


Pro 


Pro 


Gly Leu 


Lys 


He 


Pro 


Glu 
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655 










660 


Glu 


Asp 


Gly Tyr 


Thr 


lie 


Thr 


Pro 


Asn 


Thr 


Leu 


Lys 


Thr 


Leu 


Val 




665 










670 










675 


Leu 


Val 


Gly Cys 


Phe 


Thr 


Val 


Phe 


Trp 


Thr 


Val 


Tyr 


Tyr Met 


Leu 






680 










685 










690 


Glu 


Val 


Tyr Leu 


Ser 


Gin 


Arg 


Asn 


Val 


Ala 


Ser 


Asn 


Gin 


Val 


Cys 
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Arg 


Ser 


Gly Pro 


Cys 


His 


Trp 


Pro 


His 


Arg 


Ser 


Arg 


Lys 


Ala 


Lys 
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715 
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Glu 


Glu 


Gly Thr 


Glu 


Leu 


Glu 


Ser 


Val 


Pro 


Leu 


Cys 


Ser 


Ser 


Lys 
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Asp 


Pro 


Asp Glu 


Val 


Glu 


Thr 


Glu 


Ser 


Arg Gly 


Pro 


Pro 


Thr 


Thr 
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750 


Ser 


Asp 


Leu Leu 


Ala 


Pro 


Asp 


Leu 


Leu 


Glu 


Gin 


Gly Asp 


Trp 


Ser 
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Leu 


Ser 


Gin Asn 


Lys 
770 


Ser 


Ala 


Leu 


Asp 


Cys 
775 


Pro 


His 


Gin 
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Leu 
780 
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Val 


Pro His 


Gly Lys 
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Gin 


lie 


Cys 
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Arg 
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Gly 
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Glu 


Arg 


Arg 
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Gly 
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Glu 
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Ala 
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Glu 
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Ala 
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Pro 
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Asp 
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Ser 


Asp 
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Arg 


Val 


Ala 


Asp 
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Leu 
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Pro 


Glu 
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Ala 


Phe 


Arg 
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Glu 


Ala 


Ala 


Glu 


Glu 


Ala 


Cys 


Arg Ser He Gly Thr Met Val 


Glu 
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Asn 


Thr 
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Val 


Asp 


Leu 


Tyr 


Gin 


Ser 


Leu 


Gin 


Lys 


Leu 
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Asp 


Lys 
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Val 


Asp 
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Asp 


Pro 


Glu 
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Arg 
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Val 
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Leu 
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Met 
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Ser Gly 


He 
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Lys 


Arg 


Ala 


Val 
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Val 
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Ala 


Gly Asp 
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Ser 


Pro 


Asp 
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Leu 


Phe 
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Tyr 


Pro 
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Ala 
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Leu 


Ser 


Ser 


Arg 


Asp 
290 


Leu 
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Phe 


Ser 
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Arg 


Ala 
305 
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Gin 


Thr 


Val 


Met 
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Phe 
320 


Leu 


Glu 


Arg 
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Lys 


Asp 
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Glu 
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Gin 
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Val 
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Val 
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Glu 
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Ala 
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Ser 


Leu 






400 










405 


Ser 


Ser 


Cys 


Ala 


Gly Leu 


Phe 


He 






415 










420 


Trp 


Pro 


Gly 


Val 


Trp 


Val 


His 


Leu 






430 










435 


Ala 


Gly Glu 


Arg Ala 


Thr 


Gly 


Phe 






445 










450 


Leu 


Phe 


Gly 


Arg 


Ala 


Ser 


Glu 


Asp 






460 










465 


Pro 


Leu 


Gly 


Cys 


Glu 


Val 


Asp 


Val 






475 










480 


Asp 


Ser 


Lys 


Arg 


Arg 


Arg 


Leu 


Val 






490 










495 
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Met 


Ala 


Ala 


Ser 


Thr 


Gly 


Tyr 


Val 


Arg 


Leu 


Trp 


Gly Ala Ala 


Arg 


1 








5 










10 








15 


Cys 


Trp 


Val 


Leu 


Arg 


Arg 


Pro 


Met 


Leu 


Ala 


Ala Ala Gly Gly 


Arg 






20 










25 








30 


Val 


Pro 


Thr 


Ala 


Ala 


Gly 


Ala 


Trp 


Leu 


Leu 


Arg 


Gly Gin 


Arg 


Thr 










35 










40 








45 


Cys 


Asp 


Ala 


Ser 


Pro 


Pro 


T2:p 


Ala 


Leu 


Trp 


Gly Arg Gly 


Pro 


Ala 










50 










55 








60 


He 


Gly 


Gly 


Gin 


Trp 


Arg 


Gly 


Phe 


Trp 


Glu 


Ala 


Ser Ser 


Arg 


Gly 










65 










70 








75 


Gly 


Gly 


Ala 


Phe 


Ser 


Gly 


Gly 


Glu 


Asp 


Ala 


Ser 


Glu Gly 


Gly 


Ala 






80 










85 








90 


Glu 


Glu 


Gly 


Ala 


Gly 


Gly 


Ala 


Gly Gly 


Ser 


Ala 


Gly Ala 


Gly 


Glu 








95 










100 








105 


Gly 


Pro 


Val 


He 


Thr 


Ala 


Leu 


Thr 


Pro 


Met 


Thr 


He Pro 


Asp 


Val 








110 










115 








120 


Phe 


Pro 


His 


Leu 


Pro 


Leu 


He 


Ala 


He 


Thr 


Arg 


Asn Pro 


Val 


Phe 










125 










130 








135 


Pro 


Arg 


Phe 


He 


Lys 


He 


He 


Glu 


Val 


Lys 


Asn 


Lys Lys 


Leu 


Val 



140 145 150 
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Glu 


Leu 


Leu 


Arg 


Arg Lys 


Val 


Arg 


Leu 


Ala 


Gin 


Pro 


Tyr Val 


Gly 










155 








160 








165 


Val 


Phe 


Leu 


Lys 


Arg Asp 


Asp 


Ser 


Asn 


Glu 


Ser 


Asp 


Val Val 


Glu 










170 








175 








180 


Ser 


Leu 


Asp 


Glu 


lie Tyr 


His 


Pro Gly Thr Ala Gly Gly Gly Cys 








185 








190 








195 


Gin 


Ala 


Leu 


Ser 


Gly Pro 


Glu 


Leu 


Ser 


Ala 


Val 


Gly 


Ser Ala 


Pro 










200 








205 








210 


Gly 


Pro 


Gly 


Ser 


Gly Ala 


Thr 


Glu 


Arg 


Ala 


Ala 


Arg 


Ser Ser 


Asp 






215 








220 








225 


Pro 


Asp 


Pro 


Arg 


Asp Leu 


Ser 


Arg 


Leu 


Asn 


Gin 


Ser 


Val Ala 
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Ala 


Thr 


Val 


Gly Glu 




Ser 


Cys 


Val 


Arg 


Cys 


Thr 


Phe 


Leu 










5 










10 










15 


Asn 


Jr ±. <J 




JL_y 


Gin 


Arg 


Gin 


Cys 


Ser 


He 


Cys 


Glu 


Ala 


Pro 


Arg 










20 










25 










30 


His 


Lys 


Pro 


Asp 


Leu 


Asn 


His 


He 


Leu 


Arg 


Leu 


Ser 


Val 


Glu 


Glu 






35 










40 










45 


Gin 


Lys 




Pro 


Cys 


Ala 


Arg 


Cys 


Thr 


Phe 


Arg 


Asn 


Phe 


Leu 


Gly 










50 










55 










60 


Lys 


Glu 


Ala 


Cys 


Glu 


Val 


Cys 


Gly 


Phe 


Thr 


Pro 


Glu 


Pro 


Ala 


Pro 






65 










70 










75 


Gly 


Ala 


Ala 


Phe 


Leu 


Pro 


Val 


Leu 


Asn 


Gly Val 


Leu 


Pro 


Lys 


Pro 








80 










85 










90 


Pro 


Ala 


He 


Leu 


Gly Glu 


Pro 


Lys 


Gly 


Ser 


Cys 


Gin 


Glu 


Glu 


Ala 










95 










100 










105 


Gly 


Pro 


Val 


Arg 


Thr 


Ala 


Gly 


Leu 


Val 


Ala 


Thr 


Glu 


Pro 


Ala 


Arg 








110 










115 










120 


Gly 


Gin 


Cys 


Glu 


Asp 


Lys 


Asp 


Glu 


Glu 


Glu 


Lys 


Glu 


Glu 


Gin 


Glu 








125 










130 










135 


Glu 


Glu 


Glu 


Gly 


Ala 


Ala 


Glu 


Pro 


Arg 


Gly Gly Trp Ala 


Cys 


Pro 










140 










145 










150 


Arg 


Cys 


Thr 


Leu 


His 


Asn 


Thr 


Pro 


Val 


Ala 


Ser 


Ser 


Cys 


Ser 


Val 






155 










160 










165 


Cys 


Gly 


Gly 


Pro 


Arg 


Arg 


Leu 


Ser 


Leu 


Pro 


Arg 


He 


Pro 


Pro 


Glu 








170 










175 










180 


Ala 


Leu 


Val 


Val 


Pro 


Glu 


Val 


Val 


Ala 


Pro 


Ala 


Gly 


Phe 


His 


Val 










185 










190 










195 


Val 


Pro 


Ala 


Ala 


Pro 


Pro 


Pro 


Gly Leu 


Pro Gly Glu 


Gly Ala 


Glu 










200 










205 










210 


Ala 


Asn 


Pro 


Pro 


Ala 


Thr 


Ser 


Gin 


Gly 


Pro 


Ala 


Ala 


Glu 


Pro 


Glu 










215 










220 










225 


Pro 


Pro 


Arg 


Val 


Pro 


Pro 


Phe 


Ser 


Pro 


Phe 


Ser 


Ser 


Thr 


Leu 


Gin 








230 










235 










240 


Asn 


Asn 


Pro 


Val 


Pro 


Arg 


Ser 


Arg 


Arg 


Glu 


Val 


Pro 


Pro 


Gin 


Leu 










245 










250 










255 


Gin 


Pro 


Pro 


Val 


Pro 


Glu 


Ala 


Ala 


Gin 


Pro 


Ser 


Pro 


Ser 


Ala 


Gly 










260 










265 










270 


Cys 


Arg 


Gly 


Ala 


Pro 


Gin 


Gly 


Ser Gly Trp Ala 


Gly 


Ala 


Ser 


Arg 






275 










280 










285 


Leu 


Ala 


Glu 


Leu 


Leu 


Ser 


Gly 


Lys 


Arg 


Leu 


Ser 


Val 


Leu 


Glu 


Glu 



290 295 300 
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Glu Ala Thr Glu Gly Gly Thr 


Ser Arg 


Val 


Glu Ala Gly 


Ser 


Ser 










305 








310 










315 


Thr 


Ser 


Gly Ser Asp He He Asp Leu Ala Gly Asp Thr Val Arg 










320 








325 










330 


Tyr 


Thr 


Pro 


Ala 


Ser 


Pro 


Ser 


Ser Pro 


Asp 


Phe 


Thr 


Thr 


Trp 


Ser 








335 








340 










345 


Cys 


Ala 


Lys 


Cys 


Thr 


Leu 


Arg 


Asn Pro 


Thr 


Val 


Ala 


Pro 


Arg 


Cys 






350 








355 










360 


Ser 


Ala 


Cys 


Gly 


Cys 


Ser 


Lys 


Leu His 


Gly 


Phe 


Gin 


Glu 


His 


Gly 








365 








370 










375 


Glu 


Pro 


Pro 


Thr 


His 


Cys 


Pro 


Asp Cys 


Gly Ala Asp 


Lys 


Pro 


Ser 










380 








385 










390 


Pro 


Cvs 


Gly Arg 


Ser 


Cys 


Gly Arg Val 


Ser 


Ser 


Ala 


Gin 


Lys 


Ala 








395 








400 










405 


Ala 


Arg 


Val 


Leu 


Pro 


Glu 


Arg 


Pro Gly Gin Trp 


Ala 


Cys 


Pro 


Ala 








410 








415 










420 


Cys 


Thr 


Leu 


Leu 


Asn 


Ala 


Leu 


Arg Ala 


Lys 


His 


Cys 


Ala 


Ala 


Cys 








425 








430 










435 


His 


Thr 


Pro 


Gin 


Leu 


Leu 


Val 


Ala Gin Arg Arg 


Gly Ala Ala 


Pro 










440 








445 










450 


Leu 


Arg 


Arg 


Arg 


Glu 


Ser 


Met 


His Val 


Glu 


Gin 


Arg 


Arg 


Gin 


Thr 








455 








460 










465 


Asp 


Glu 


Gly 


Glu 


Ala 


Lys 


Ala 


Leu Txp 


Glu 


Asn 


He 


Val 


Ala 


Phe 








470 








475 










480 


Cys 


Arg 


Glu 


Asn 


Asn 


Val 


Ser 


Phe Val 


Asp 


Asp 


Ser 


Phe 


Pro 


Pro 






485 








490 










495 


Gly Pro 


Glu 


Ser 


Val 


Gly 


Phe 


Pro Ala 


Gly Asp 


Ser 


Val 


Gin 


Gin 










500 






505 










510 


Arg Val 


Arg 


Gin 


Trp 


Leu 


Arg 


Pro Gin 


Glu 


He 


Asn 


Cys 


Ser 


Val 








515 








520 










525 


Phe Arg Asp His 


Arg 


Ala 


Thr 


Trp Ser 


Val 


Phe 


His 


Thr 


Leu 


Arg 










530 








535 










540 


Pro 


Ser 


Asp 


He 


Leu Gin Gly Leu Leu Gly Asn Cys 


Trp 


Phe 


Leu 








545 








550 










555 


Ser 


Ala 


Leu 


Ala 


Val 
560 


Leu 


Ala 


Glu Arg 


Pro 
565 


Asp 


Leu 


Val 


Glu 


Arg 
570 


Val 


Met 


Val 


Thr 


Arg 


Ser 


Leu 


Cys Ala 


Glu 


Gly Ala Tyr 


Gin 


Val 










575 








580 










585 


Arg 


Leu 


Cys 


Lys 


Asp 


Gly 


Thr 


Trp Thr 


Thr 


Val 


Leu 


Val 


Asp 


Asp 








590 








595 










600 


Met 


Leu 


Pro 


Cys 


Asp 


Glu 


Ala 


Gly Cys 


Leu 


Leu 


Phe 


Ser 


Gin 


Ala 








605 








610 










615 


Gin 


Arg 


Lys 


Gin 


Leu 


Trp 


Val 


Ala Leu 


He 


Glu 


Lys 


Ala 


Leu 


Ala 






620 








625 










630 


lays 


Leu 


His 


Gly 


Ser 


Tyr 


Phe 


Ala Leu 


Gin 


Ala 


Gly Arg 


Ala 


He 






635 








640 










645 


Glu 


Gly 


Leu 


Ala 


Thr 


Leu 


Thr 


Gly Ala 


Pro 


Cys 


Glu 


Ser 


Leu 


Ala 








650 








655 










660 


Leu 


Gin 


Leu 


Ser 


Ser 
665 


Thr 


Asn 


Pro Arg 


Glu 
670 


Glu 


Pro 


Val 


Asp 


Thr 
675 


Asp 


Leu 


He 


Trp 


Ala 


Lys 


Met 


Leu Ser 


Ser 


Lys 


Glu 


Ala 


Gly 


Phe 






680 








685 










690 


Leu 


Met 


Gly Ala 


Ser 


Cys 


Gly Gly Gly Asn 


Met 


Lys 


Val 


Asp 


Asp 










695 






700 










705 


Ser 


Ala 


Tyr 


Glu 


Ser 


Leu 


Gly Leu Arg 


Pro 


Arg 


His 


Ala 


Tyr 


Ser 








710 








715 










720 


He 


Leu 


Asp 


Val 


Arg 


Asp 


Val 


Gin Gly Thr Arg 


Leu 


Leu 


Arg 


Leu 








725 






730 










735 


Arg 


Asn 


Pro 


Trp 


Gly Arg 


Phe 


Ser Trp 


Asn 


Gly 


Ser 


Trp 


Ser 


Asp 






740 








745 










750 


Glu 


Trp 


Pro 


His 


Trp 


Pro 


Gly His Leu 


Arg 


Gly 


Glu 


Leu 


Met 


Pro 








755 








760 










765 


His 


Gly 


Ser 


Ser 


Glu 


Gly Val 


Phe Trp 


Met 


Glu 


Tyr 


Gly Asp 


Phe 
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770 



Val 


Arg 


Tyr 


Phe Asp 
785 


Ser 


Val 


Trp 


Gin 


Glu 


Ala Arg 


Val 


Gin 






800 






Ala 


Pro 


Val 


Gly Val 
815 


Thr 


Ala 


Leu 


Glu 


Phe 


Ala Leu 
830 


Phe 


Gin 


Val 


Asp 


Ser 


His Leu 
845 


Leu 


Asp 


Thr 


Phe 


Gly 


Ser Gly Gly 


His 








860 






His 


Ser 


Lys 


Arg Ala 
875 


Val 


Lys 


Leu 


Glu 


Pro 


Gly Glu 
890 


Tyr 


Ala 


Trp 


Gly 


Pro 


Pro Leu 
905 


Pro 


Gly 


Pro 


Leu 


Pro 


Gin Ala 
920 


His 


Ala 


Phe 


Thr 


Arg 


Pro Ser 


Leu 





935 







775 










Asp 


lie 


Cys 


Lys Val 


xlXS 


Ser 


Asp 






790 








7 Q R 


Gly 


Cys 


Phe 


Pro Ser 


Ser 


J\±ci 


Ser 


805 








O-L U 


Leu 


Thr 


Val 


Leu Glu 


Arg 


A±a 


Ser 






820 










Glu 


Gly 


Ser Arg Arg 


Ser 


Asp 






835 










Leu 


Cys 


lie 


Leu Val 


Phe 


Arg 


TV T -a 

A±a 




850 








855 


Leu 


Ser 


Leu 


Gly Arg 


Leu 


Leu 


Ala 






865 








870 


Lys 


Phe 


Val 


Ser Cys 


Asp 


Val 


Met 




880 








885 


Val 


Val 


Cys 


Cys Ala 


Phe 


Asn 


His 






895 








900 


Thr 


Pro 


Ala 


Pro Gin Gly Thr Trp 






910 








915 


Pro 


Gly Pro Arg Gly Gly Leu Arg 






925 








930 
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Ala 


Met 


Ala 


Ala 


Val 


Arg 


Gly Ala 


Pro 


Leu 


Leu 


Ser 


Cys 


Leu 


Leu 


1 








5 










10 










15 


Leu 


Leu 


Ala 


Leu 


Cys 


Pro 


Gly 


Gly Arg 


Pro 


Gin 


Thr 


Val 


Leu 


Thr 










20 










25 










30 


Asp 


Asp 


Glu 


lie 


Glu 


Glu 


Phe 


Leu 


Glu Gly Phe 


Leu 


Ser 


Glu 


Leu 






35 










40 










45 


Glu 


Pro 


Glu 


Pro 


Arg 
50 


Glu 


Asp 


Asp 


Val 


Glu 
55 


Ala 


Pro 


Pro 


Pro 


Pro 
60 


Glu 


Pro 


Thr 


Pro 


Arg 


Val 


Arg 


Lys 


Ala Gin Ala Gly Gly Lys 


Pro 










65 










70 










75 


Gly 


Lys 


Arg 


Pro 


Gly 


Thr 


Ala 


Ala 


Glu 


Val 


Pro 


Pro 


Glu 


Lys 


Thr 




80 










85 










90 


Lys 


Asp 


Lys 


Gly 


Lys 


Lys 


Gly 


Lys 


Lys 


Asp 


Lys 


Gly 


Pro 


Lys 


Val 




95 










100 










105 


Pro 


Lys 


Glu 


Ser 


Leu 


Glu 


Gly 


Ser 


Pro 


Arg 


Pro 


Pro 


Lys 


Lys 


Gly 








110 










115 










120 


Lys 


Glu 


Lys 


Pro 


Pro 


Lys 


Ala 


Thr 


Lys 


Lys 


Pro 


Lys 


Glu 


Lys 


Pro 






125 










130 










135 


Pro 


Lys 


Ala 


Thr 


Lys 


Lys 


Pro 


Lys 


Glu 


Lys 


Pro 


Pro 


Lys 


Ala 


Thr 








140 










145 










150 


Lys 


Lys 


Pro 


Lys 


Glu 


Lys 


Pro 


Pro 


Lys 


Ala 


Thr 


Lys 


Lys 


Pro 


Pro 




155 










160 










165 


Ser 


Gly Lys 


Arg 


Pro 


Pro 


lie 


Leu 


Ala 


Pro 


Ser 


Glu 


Thr 


Leu 


Glu 








170 










175 










180 


Trp 


Pro 


Leu 


Pro 


Pro 


Pro 


Pro 


Ser 


Pro 


Gly 


Pro 


Glu 


Glu 


Leu 


Pro 








185 










190 










195 


Gin Glu Gly Gly Ala 


Pro 


Leu 


Ser 


Asn 


Asn 


Trp 


Gin 


Asn 


Pro 


Gly 










200 










205 










210 


Glu 


Glu 


Thr 


His 


Val 


Glu 


Ala 


Arg 


Glu 


His 


Gin 


Pro 


Glu 


Pro 


Glu 
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Glu Glu Thr 
Arg Glu Asp 
Pro Arg Pro 
Pro Glu Thr 
Arg lie Glu 
Asp Tyr Gly 
Tyr Tyr Phe 
Gin Thr Asp 
Asp Ser Ser 
Gly Lys Asp 
Glu Trp Thr 
Glu Ser His 
Leu Arg His 
Thr Gly Ala 
Glu Asp Asp 
Thr Thr Arg 
lie His Asp 
Asp Ser Gin 
Thr Phe His 
Leu Pro Glu 
Thr Trp Asn 
Ser Val Ala 
Ala Thr Asp 
Arg Gin Leu 
Arg Thr Tyr 
Ala Met Glu 
Pro Glu Phe 
Gly Arg Glu 
Tyr Arg Asp 
Arg lie His 
Ala Ala Gin 





215 




Glu 


Gin 


Pro 




230 




Tyr 


Glu 


Asp 




245 




Pro 


Pro 


Ser 




260 




Pro 


Glu 


Glu 




275 




Pro 


Pro 


Val 




290 




Asp 


Gly 


Tyr 




305 




Gly 


Pro 


Pro 




320 




Glu 


Glu 


Lys 




335 




Pro 


Lys 


Glu 




350 




His 


Lys 


Glu 




365 




Pro 


Thr 


Glu 




380 




Arg 


He 


Glu 




395 




Gly 


Leu 


Gly 




410 




Thr 


Glu 


Asp 




425 




Ala 


Arg 


Thr 




440 




Phe 


Thr 


Gly 




455 




Asp 


Phe 


Val 




470 




Thr 


Trp 


Val 




485 




Gly 


Asn 


Val 




500 




Pro 


Val 


Val 




515 




Gly 


Ser 


Leu 




530 




Pro 


Val 


Tyr 




545 




Asp 


Leu 


Asp 




560 




Met 


Lys 


Val 




575 




Ser 


Leu 


Gly 




590 




lie 


Ser 


Asp 




605 




Arg 


Tyr 


Thr 




620 




Leu 


Leu 


Leu 




635 




Gly 


Asn 


Pro 




650 




Leu 


Val 


Pro 




665 




Met 


Gly 


Ser 




680 





mr 


Leu 


Asp 


pne 


QsXU 


Tyr 


Arg 


Arg 


Arg 


Lys 


TV 1 Hi 

Ala 


Pro 


Lys 


Pro 


Leu 


va± 


x±e 


Pro 


Pro 


Pro 


Gxn 


Glu 


Glu 


Leu 


Glu 


Thr 


Asp 


Pro 


Arg 


Lys 


Lys 


Val 


Lys 


Asp 


Asn 


Gin 


Ala 


Gin 


Arg 


Asp 


Tyr 


Tyr 


Gin 


Trp 


-r ~\ — 

l±e 


Val 


He 


Thr 


Thr 


Thr 


Phe 


Met 


Tyr 


Thr 


Asp 


Lys 


Asp 


Ala 


Arg 


Phe 


Cys 


Met 


Arg 


Ser 


Tyr 


Tyr 


Phe 


Arg 


His 


Val 


Asn 


Glu 


Lys 


Ser 


Ser 


Asn 


Pro 


Gly 


Ala 


Gly 


He 


Leu 


Leu 


Met 


Arg 


Val 


Arg 


Ser 


Leu 


Asn 


Glu 


Phe 


Gly 



220 






Tyr 


Asn 


Asp 


235 






He 


Arg 


Arg 


250 






Arg 


Pro 


Glu 


265 






Ala 


Pro 


Ala 


280 






Leu 


Pro 


Pro 


295 






Asn 


Tyr 


Asp 


310 






Lys 


Pro 


Asp 


325 






Lys 


Lys 


Pro 


340 






Lys 


Trp 


Ala 


355 






Gly 


Glu 


Glu 


370 






Cys 


Pro 


Pro 


385 






ile 


Arg 


Ala 


400 






Gly 


Arg 


Leu 


415 






Asp 


Gly 


Ala 


430 






Glu 


Val 


Asp 


445 






Gin 


Gly 


Arg 


460 






Phe 


Val 


Gly 


475 






Asn 


Gly 


Tyr 


490 






Thr 


Pro 


Val 


505 






He 


Arg 


Ile 


520 






Leu 


Glu 


Val 


535 






Ala 


Gin 


Asn 


550 






His 


Ser 


Tyr 


565 






Glu 


Cys 


Pro 


580 






Arg 


Gly 


Leu 


595 






Glu 


His 


Glu 


610 






His 


Gly 


Asn 


625 






Gin 


Tyr 


Leu 


640 






Ser 


Leu 


Val 


655 






Pro 


Asp 


Gly 


670 






Asn 


Trp 


Ala 


685 











225 


Gin 


lie 


Glu 






240 


Gin 


Lys 


Gin 






255 


Arg 


Val 


Trp 






270 


Pro 


Glu 


Glu 






285 


Leu 


Pro 


Pro 






300 


Asp 


Met 


Asp 






315 


Ala 


Glu 


Arg 






330 


Lys 


Lys 


Glu 






345 


Val 


Glu 


Lys 






360 


Leu 


Glu 


Glu 






375 


He 


Gly 


Met 






390 


Ser 


Ser 


Met 






405 


Asn 


Met 


Gin 






420 


Trp 


Cys 


Ala 






435 


Thr 


Arg 


Arg 






450 


Asp 


Ser 


Ser 






465 


Phe 


Ser 


Asn 






480 


Glu 


Glu 


Met 






495 


Leu 


Ser 


Glu 






510 


Tyr 


Pro 


Leu 






525 


Leu 


Gly 


Cys 






540 


Glu 


Val 


Val 






555 


Lys 


Asp 


Met 






570 


Thr 


Ile 


Thr 






585 


Lys 


He 


Tyr 






600 


Leu 


Gly 


Glu 






615 


Glu 


Val 


Leu 






630 


Cys 


Arg 


Glu 






645 


Gin 


Asp 


Thr 






660 


Tyr 


Glu 


Val 






675 


Leu 


Gly 


Leu 






690 
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Trr) 


Thr 


Glu 


Glu 


Gly 


Phe 


Asp 


He 


Phe Glu 


Asp 


Phe 


Pro 


Asp Leu 








695 








700 








705 


Asn 


Ser 


Val 


Leu 


Trp 


Gly Ala Glu Glu Arg Lys 


Trp 


Val 


Pro Tyr 










710 








715 








720 


Arg 


Val 


Pro 


Asn 


Asn 


Asn 


Leu 


Pro 


He Pro 


Glu 


Arg 


Tyr 


Leu Ser 








725 








730 








735 


Pro 


Asp 


Ala 


Thr 


Val 


Ser 


Thr 


Glu 


Val Arg 


Ala 


He 


He 


Ala Trp 








740 








745 








750 


Met 


Glu 


Lys 


Asn 


Pro 


Phe 


Val 


Leu 


Gly Ala 


Asn 


Leu 


Asn 


Gly Gly 








755 








760 








765 


Glu 


Arg 


Leu 


Val 


Ser 
770 




Pro 


Tyr 


Asp Met 
775 


Ala 


Arg 


Thr 


Pro Thr 
780 


Gin 


Glu 


Gin 


Leu 


Leu 
785 


Ala 


Ala 


Ala 


Met Ala 
790 


Ala 


Ala 


Arg 


Gly Glu 
795 


Asp 


Glu 


Asp 


Glu 


Val 


Ser 


Glu 


Ala 


Gin Glu 


Thr 


Pro 


Asp 


His Ala 






800 








805 








810 


He 


Phe 


Arg 


Trp 


Leu 




He 


Ser 


Phe Ala 


Ser 


Ala 


His 


Leu Thr 






815 








820 








825 


Leu 


Thr 


Glu 


Pro 


Tvr 




Gly Gly Cys Gin 


Ala 


Gin 


Asp 


Tyr Thr 










830 








835 








840 


Gly Gly Met 


Glv 


He 


Val 


Asn Gly Ala Lys 


Trp 


Asn 


Pro 


Arg Thr 










845 








Q n 
o D u 








855 


Gly Thr 


He 


Asn 


Asp 


Phe 


Ser 


Tyr 


Leu Hi s 


Thr 


Asn 


Cys 


Leu Glu 










860 








ODD 








870 


Leu 


Ser 


Phe 


Tyr 


Leu 


Gly 


Cys 


Asp 


j-iys irne 


Pro 


His 


Glu 


Ser Glu 








875 








880 








885 


Leu 


Pro 


Arg 


Glu 


Trp 


Glu 


Asn 


Asn 


Lys Glu 


Ala 


Leu 


Leu 


Thr Phe 








890 








895 








900 


Met 


Glu 


Gin 


Val 


His 


Arg 


Gly 


He 


Asn His 


Gly Val 


Lys 


Thr Ala 










905 








910 








915 


Ser Gly 


Gly Asp 


Tvr 


J. j-t' 


Arg 


He 


Leu Asn 


Pro 


Gly 


Glu 


Tyr Arg 










920 








925 








930 


Val 


Thr 


Ala 


His 


Ala 


Glu 


Gly Tyr 


Thr Pro 


Ser 


Ala 


Lys 


Thr Cys 










935 








940 








945 


Asn 


Val 


Asp 


Tyr 


Asp 


X. X w 


Gly Ala 


Thr Gin 


Cys 


Asn 


Phe 


He Leu 








950 








955 








960 


Ala 


Arg 


Ser 


Asn 


Trp 
965 


Lys 


Arg 


He 


Arg Glu 
970 


He 


Met 


Ala 


Met Asn 

975 


Gly Asn 


Arg 


Pro 


He 


ST JL U 


His 


He 


Asp Pro 


Ser 


Arg 


Pro 


Met Thr 








980 








985 








990 


Pro 


Gin 


Gin 


Arg 


Arg 
995 


Leu 


Gin 


Gin 


Arg Arg 
1000 


Leu 


Gin 


His 


Arg Leu 
1005 


Arg 


Leu 


Arg 


Ala 


Gin 


Met 


Arg 


Leu 


Arg Arg 


Leu 


Asn 


Ala 


Thr Thr 






1010 








1015 








1020 


Thr 


Leu 


Gly 


Pro 


His 


Thr 


Val 


Pro 


Pro Thr 


Leu 


Pro 


Pro 


Ala Pro 






1025 








1030 








1035 


Ala 


Thr 


Thr 


Leu 


Ser 


Thr 


Thr 


He 


Glu Pro 


Trp 


Gly 


Leu 


He Pro 








1040 








1045 








1050 


Pro 


Thr 


Thr 


Ala 


Gly 


Trp 


Glu 


Glu 


Ser Glu 


Thr 


Glu 


Thr 


Tyr Thr 








1055 








1060 








1065 


Glu 


Val 


Val 


Thr 


Glu 


Phe 


Gly Thr 


Glu Val 


Glu 


Pro 


Glu 


Phe Gly 










1070 








1075 








1080 


Thr 


Lys 


Val 


Glu 


Pro 


Glu 


Phe 


Glu 


Thr Gin 


Leu 


Glu 


Pro 


Glu Phe 








1085 








1090 








1095 


Glu 


Thr 


Gin 


Leu 


Glu 
1100 


Pro 


Glu 


Phe 


Glu Glu 
1105 


Glu 


Glu 


Glu 


Glu Glu 
1110 


Lys 


Glu 


Glu 


Glu 


He 


Ala 


Thr 


Gly Gin Ala 


Phe 


Pro 


Phe 


Thr Thr 










1115 








1120 








1125 


Val 


Glu 


Thr 


Tyr 


Thr 
1130 


Val 


Asn 


Phe 


Gly Asp 
1135 


Phe 
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<400> 44 



Met Ala Arg Gly Tyr 


Gly Ala 


Thr 


Val 


Ser 


Leu 


Val 


Leu 


Leu 


Gly 


1 




5 










10 










15 


Leu Gly Leu 


Ala 


Leu 


AX a 


Val 


He 


Val 


Leu 


Ala 


Val 


Val 


Leu 


Ser 




20 










25 










30 


Arg His Gin 


Ala 


Pro 


Cys 


Gly 


Pro 


Gin 


Ala 


Phe 


Ala 


His 


Ala 


Ala 




35 










40 










45 


Val Ala Ala 


Asp 


Ser 


Lys 


Val 


Cys 


Ser 


Asp 


He 


Gly Arg Ala 


He 




50 








55 










60 


Leu Gin Gin Gin Gly 


Ser 


Pro 


Val 


Asp 


Ala 


Thr 


He 


Ala 


Ala 


Leu 






65 










70 










75 


Val Cys Thr 


Ser 


Val 


Val 


Asn 


Pro 


Gin 


Ser 


Met 


Gly 


Leu Gly Gly 




80 










85 










90 


Gly Val lie 


Phe 


Thr 


He 


Tyr 


Asn 


Val 


Thr 


Thr Gly Lys 


Val 


Glu 




95 










100 










105 


Val lie Asn 


Ala 


Arg 
110 


Glu 


Thr 


Val 


Pro 


Ala 
115 


Ser 


His 


Ala 


Pro 


Ser 
120 


Leu Leu Asp 


Gin 


Cys 


Ala 


Gin 


Ala 


Leu 


Pro 


Leu 


Gly Thr 


Gly Ala 




125 










130 










135 


Gin Trp lie Gly Val 


Pro 


Gly Glu 


Leu Arg Gly Tyr 


Ala 


Glu 


Ala 






140 










145 










150 


His Arg Arg 


His 


Gly Arg Leu 


Pro 


Trp 


Ala 


Gin 


Leu 


Phe 


Gin 


Pro 




155 










160 










165 


Thr He Ala 


Leu 


Leu Arg Gly Gly His Val Val 


Ala 


Pro 


Val 


Leu 






170 










175 










180 


Ser Arg Phe 


Leu 


His 


Asn 


Ser 


He 


Leu 


Arg 


Pro 


Ser 


Leu 


Gin 


Ala 




185 










190 










195 


Ser Thr Leu 


Arg 


Gin 


Leu 


Phe 


Phe Asn Gly Thr Glu 


Pro 


Leu 


Arg 




200 










205 










210 


Pro Gin Asp 


Pro 


Leu 


Pro 


Trp 


Pro 


Ala 


Leu 


Ala 


Thr 


Thr 


Leu 


Glu 




215 










220 










225 


Thr Val Ala 


Thr 


Glu 


Gly Val 


Glu 


Val 


Phe 


Tyr 


Thr 


Gly Arg 


Leu 






230 










235 










240 


Gly Gin Met 


Leu 


Val 


Glu 


Asp 


He 


Ala 


Lys 


Glu 


Gly 


Ser 


Gin 


Leu 




245 










250 










255 


Thr Leu Gin 


Asp 


Leu 


Ala 


Lys 


Phe 


Gin 


Pro 


Glu 


Val 


Val 


Asp 


Ala 




260 










265 










270 


Leu Glu Val 


Pro 


Leu 


Gly Asp Tyr 


Thr 


Leu 


Tyr 


Ser 


Pro 


Pro 


Pro 






275 










280 










285 


Pro Ala Gly Gly Ala 


He 


Leu 


Ser 


Phe 


He 


Leu 


Asn 


Val 


Leu 


Arg 






290 










295 










300 


Gly Phe Asn 


Phe 


Ser 


Thr 


Glu 


Ser 


Met 


Ala Arg 


Pro 


Glu 


Gly Arg 




305 










310 










315 


Val Asn Val 


Tyr 


His 


His 


Leu Val 


Glu 


Thr 


Leu 


Lys 


Phe 


Ala 


Arg 




320 










325 










330 


Gly Gin Arg Trp Arg 


Leu 


Gly Asp 


Pro 


Arg 


Ser 


His 


Pro 


Lys 


Leu 






335 










340 










345 


Gin Asn Ala 


Ser 


Arg 


Asp 


Leu Leu Gly Glu Thr 


Leu 


Ala 


Gin 


Leu 






350 










355 










360 


He Arg Gin 


Gin 


He 


Asp Gly Arg Gly Asp His 


Gin 


Leu 


Ser 


His 




365 










370 










375 


Tyr Ser Leu 


Ala Glu Ala Trp Gly His Gly Thr Gly Thr 


Ser 


His 




380 










385 










390 


Val Ser Val 


Leu 


Gly 
395 


Glu 


Asp 


Gly 


Ser 


Ala 
400 


Val 


Ala 


Ala 


Thr 


Ser 
405 


Thr He Asn 


Thr 


Pro 


Phe 


Gly Ala Met Val 


Tyr 


Ser 


Pro 


Arg 


Thr 
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410 






Gly 


lie 


lie 


Leu 


Asn 


Asn 


Glu 








425 






Pro 


Arg 


Gly 


Ser 


Gly Thr 


Thr 










440 






Val 


Gly 


Gly Ala 


Pro 


Gly Arg 










455 






Arg 


Ser 


Pro 


Ser 


Ser 
470 


Met 


Val 


Gin 


Gly 


Ser 


Lys 


Leu 
485 


Val 


He 


lie 


Ser 


Ala 


Val 


Ala 
500 


Gin 


Ala 


Phe 


Asp 


Leu 


Arg 


Ala 
515 


Ala 


He 


Ser 


Lys 


Gly Cys 


Val 


Glu 


Tyr 










530 






Asp 


Leu 


Ala 


Ala 


Leu 
545 


Pro 


Arg 


Val 


lie 


Leu 


Leu 


Pro 
560 


Gin 


Pro 


Val 


Gin 


Arg 


Gly 


Leu 

575 


Gin 


Asp 


Pro 


Phe 


Phe 


Leu 


Asn 
590 


Val 


Val 


Cys 


Val 


Tyr 


Ala 


Val 
605 


Ser 


Asp 


Gly Tyr 

















415 








420 


Leu 


Leu 


Asp 


Leu 


Cys 


Glu 


Arg Cys 






430 








435 


Pro 


Ser 


Pro 


Val 


Ser 


Gly Asp Arg 






445 








450 


Cys 


Trp 


Pro 


Pro 


Val 


Pro 


Gly Glu 






460 








465 


Pro 


Ser 


He 


Leu 


He 


Asn 


Lys Ala 






475 








480 


Gly Gly Ala Gly Gly Glu Leu lie 






490 








495 


He 


Met 


Ser 


Lys 


Leu 


Trp 


Leu Gly 






505 








510 


Ala 


Ala 


Pro 


He 


Leu 


His 


Val Asn 






520 








525 


Glu 


Pro 


Asn 


Phe 


Ser 


Gin 


Lys Gin 






535 








540 


Leu 


Val 


Ser 


Asn 


Ser 


Trp 


Pro Gin 






550 








555 


Pro 


Lys 


Leu 


Leu 


Ala 


Leu 


Gin Glu 




565 








570 


Arg Gly Gin Asn Gin Thr Gin Arg 






580 








585 


Gin 


Ala 


Val 


Ser 


Gin 


Glu 


Gly Ala 






595 








600 


Leu 


Arg 


Lys 


Ser 


Gly 


Glu 


Ala Ala 






610 








615 
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Met 


Cys 


Gin 


Trp 


Ala 


His 


Pro 


Ser 


Leu 


Gly 


Pro 


His 


Ser Glu 


Pro 


1 




5 










10 










15 


Ala 


Ser 


Pro 


Ser 


Val 
20 


Glu 


Tyr 


He 


Arg 


Arg 
25 


Gin 


Lys 


Gin 


Pro 


Arg 
30 


Pro 


Pro 


Pro 


Ser 


Arg 
35 


Arg 


Arg 


Arg 


Pro 


Glu 
40 


Arg 


Val 


Trp 


Pro 


Asp 
45 


Pro 


Pro 


Glu 


Glu 


Lys 
50 


Ala 


Pro 


Ala 


Pro 


Ala 
55 


Pro 


Glu 


Glu 


Arg 


He 
60 


Glu 


Pro 


Pro 


Val 


Lys 
65 


Pro 


Leu 


Leu 


Pro 


Pro 
70 


Leu 


Pro 


Pro 


Asp 


Tyr 
75 


Gly 


Asp 


Gly 


Tyr 


Val 


He 


Pro 


Asn 


Tyr 


Asp 


Asp 


Met 


Asp 


Tyr 


Tyr 


80 










85 










90 


Phe 


Gly 


Pro 


Pro 


Pro 


Pro 


Gin 


Lys 


Pro Asp Ala Glu Arg 


Gin 


Thr 








95 










100 










105 


Asp 


Glu 


Glu 


Lys 


Glu 


Glu 


Leu 


Lys 


Lys 


Pro 


Lys 


Lys 


Glu 


Asp 


Ser 






110 










115 










120 


Ser 


Pro 


Lys 


Glu 


Glu 


Thr 


Asp 


Lys 


Trp 


Ala 


Val 


Glu 


Lys 


Gly 


Lys 








125 










130 










135 


Asp 


His 


Lys 


Glu 


Pro 


Arg 


Lys 


Gly Glu 


Glu 


Leu 


Glu 


Glu 


Glu 


Trp 










140 










145 










150 


Thr 


Pro 


Thr 


Glu 


Lys 


Val 


Lys 


Cys 


Pro 


Pro 


He 


Gly Met 


Glu 


Ser 










155 










160 










165 


His 


Arg 


He 


Glu 


Asp 


Asn 


Gin 


He 


Arg 


Ala 


Ser 


Ser 


Met 


Leu 


Arg 
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170 








His 


Gly 


Deu Gly 


Ala 


Gin 


Arg 


Gly 






185 








Ala 


Thr 


Glu Asp 


Asp 
200 


Tyr 


Tyr 


Asp 


Asp 


Ala 


Arg Thr 


Gin 
215 


Trp 


lie 


Glu 


Arg 


Phe 


Thr Gly 


Val 
230 


lie 


Thr 


Gin 


Asp 


Asp 


Phe Val 


Thr 
245 


Thr 


Phe 


Phe 


Gin 


Thr 


Trp Val 


Met 
260 


Tyr 


Thr 


Asn 


Thr 


Met 


Pro Arg 


Leu 
275 


Leu 


Ala 


Leu 


Val 


Gly 


Ser Gin 


Arg Gly 


Trp 


Gin 








290 








Ser 


Pro 


Asp Leu 


Ser 
305 


Trp 


Glu 


Arg 


Glu 
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Met 


Glu 


Arg 


Asp 


Ser 


His 


Gly 


Asn 


1 








5 








Ser 


Ala 


Gly 


Ala 


Ser 
20 


Pro 


Ala 


Gin 


Pro 


Gly Arg 


Ala 


Ser 


Pro 


Ala 


Gin 










35 








Pro 


Ala 


Gly 


Thr 


Pro 
50 


Pro 


Gly 


Arg 


Pro 


Ala 


Gly 


Thr 


Pro 
65 


Pro 


Gly 


Arg 


Pro 


Ala 


Gin 


Ala 


Ser 
80 


Pro 


Ala 


Arg 


Leu 


Ser 


Arg 


Ser 


Ser 
95 


Ser 


Gly 


Arg 


Ser 


Val 


Thr 


Thr 


Ser 
110 


Pro 


Thr 


Arg 


Pro 


Val 


Gly 


Ala 


Val 
125 


Pro 


lie 


Arg 


Pro 


Ala 


Thr 


Arg 


Ala 
140 


Thr 


Arg 


Glu 


Lys 


Phe 


Thr 


Trp 


Arg 
155 


Glu 


Gly 


Gin 


Cys 


Val 


Leu 


Leu 


Leu 
170 


lie 


Ala 


Leu 


Phe 


Gin 


Phe 


Trp 


Gin 
185 


Gly 


His 


Thr 


Arg 


Glu 


Ser 


Cys 


Pro 
200 


Lys 


His 


Ala 


Asp 


Cys 


Lys 


Leu 


Lys 
215 


Ser 


Asp 


Glu 


Trp 


Asp 


Lys 


Ser 


Leu 


Leu 


Lys 


He 





175 








180 


Arg 


Leu Asn 


Met 


Gin 


Thr 


Gly 


190 








195 


Gly Ala Trp 


Cys 


Ala 


Glu 


Asp 




205 








210 


Val 


Asp Thr 


Arg Arg 


Thr 


Thr 




220 








225 


Gly Arg Asp 


Ser 


Ser 


He 


His 




235 








240 


Val 


Gly Phe 


Ser 


Asn 


Asp 


Ser 




250 








255 


Gly 


Tyr Glu 


Glu 


Met 


Val 


Gly 


265 








270 


Leu 


Pro Leu 


Ser Gly Arg Gly 




280 








285 


Tyr 


Cys Ser 


Glu 


Ala 


Cys 


Leu 




295 








300 


Gly 


Gin Gly His 


Thr 


Arg 


Ala 




310 








315 



Ala 


Ser 


Pro 


Ala 


Arg 


Thr 


Pro 




10 










15 


Ala 


Ser 


Pro 


Ala 


Gly Thr 


Pro 




25 










30 


Ala 


Ser 


Pro 


Ala 


Gin 


Ala 


Ser 




40 










45 


Ala 


Ser 


Pro 


Ala 


Gin 


Ala 


Ser 




55 










60 


Ala 


Ser 


Pro 


Gly Arg Ala 


Ser 




70 










75 


Ala 


Ser 


Pro 


Ala 


Leu 


Ala 


Ser 




85 










90 


Ser 


Ser 


Ser 


Ala 


Arg 


Ser 


Ala 




100 










105 


Val 


Tyr 


Leu 


Val 


Arg 


Ala 


Thr 




115 










120 


Ser 


Ser 


Pro 


Ala 


Arg 


Ser 


Ala 




130 










135 


Ser 


Pro 


Gly 


Thr 


Ser 


Leu 


Pro 




145 










150 


Lys 


Gin 


Leu 


Pro 


Leu 


He 


Gly 


160 










165 


Val 


Val 


Ser 


Leu 


He 


He 


Leu 




175 










180 


Gly 


He 


Arg 


Tyr 


Lys 


Glu 


Gin 


190 










195 


Val 


Arg 


Cys 


Asp 


Gly Val 


Val 




205 










210 


Leu 


Gly Cys 


Val 


Arg 


Phe 


Asp 




220 










225 


Tyr 


Ser 


Gly 


Ser 


Ser 


His 


Gin 
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230 










235 










240 




Leu 


Pro 


He 


Cys 
245 


Ser 


Ser 


Asn 


Trp 


Asn 
250 


Asp 


Ser 


Tyr 


Ser 


Glu 
255 


Lys 


Thr 


Cys 


Gin 


Gin 
260 


Leu 


Gly 


Phe 


Glu 


Ser 
265 


Ala 


His 


Arg 


Thr 


Thr 
270 


Glu 


Val 


Ala 


His 


Arg 
275 


Asp 


Phe 


Ala 


Asn 


Ser 
280 


Phe 


Ser 


He 


Leu 


Arg 
285 


Tyr 


Asn 


Ser 


Thr 


He 
290 


Gin 


Glu 


Ser 


Leu 


His 
295 


Arg 


Ser 


Glu 


Cys 


Pro 
300 


Ser 


Gin 


Arg 


Tyr 


He 
305 


Ser 


Leu 


Gin 


Cys 


Ser 
310 


His 


Cys 


Gly 


Leu 


Arg 
315 


Ala 


Met 


Thr 


Gly Arg 


He 


Val 


Gly 


Gly Ala 


Leu 


Ala 


Ser 


Asp 


Ser 










320 










325 










330 


Lys 


Trp 


Pro 


Trp 


Gin 
335 


Val 


Ser 


Leu 


His 


Phe 
340 


Gly 


Thr 


Thr 


His 


He 
345 


Cys 


Gly 


Gly 


Thr 


Leu 
350 


He 


Asp 


Ala 


Gin 


Trp 
355 


Val 


Leu 


Thr 


Ala 


Ala 
360 


His 


Cys 


Phe 


Phe 


Val 
365 


Thr 


Arg 


Glu 


Lys 


Val 
370 


Leu 


Glu 


Gly 


Trp 


Lys 
375 


Val 


Tyr 


Ala 


Gly Thr 


Ser 


Asn 


Ser 


Tyr 


Pro 


Gly 


Pro 


Lys 


Ala 


Ser 










380 










385 










390 


Ala 


Gly 


Gin 


Lys 


Ser 
395 


Lys 


Thr 


Leu 


Lys 


Asp 
400 


Pro 


Tyr 


Met 


Glu 


His 
405 


Phe 


Cys 


Phe 


He 


He 
410 


Arg 


Glu 


Thr 


Glu 


Ala 
415 


Gin 


Gly 


Leu 
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Met 


Ala 


Ala 


Val 


Arg 


Gly 


Ala 


Pro 


Leu 


Leu 


Ser 


Cys 


Leu 


Leu 


Ala 


1 








5 










10 










15 


Leu 


Leu 


Ala 


Leu 


Cys 
20 


Pro 


Gly 


Gly 


Arg 


Pro 
25 


Gin 


Thr 


Val 


Leu 


Thr 
30 


Asp 


Asp 


Glu 


He 


Glu 
35 


Glu 


Phe 


Leu 


Glu 


Gly 
40 


Phe 


Leu 


Ser 


Glu 


Leu 
45 


Glu 


Pro 


Glu 


Pro 


Arg 

50 


Glu 


Asp 


Asp 


Val 


Glu 

55 


Ala 


Pro 


Pro 


Pro 


Pro 

60 


Glu 


Pro 


Thr 


Pro 


Arg 
65 


Val 


Arg 


Lys 


Ala 


Gin 
70 


Ala 


Gly 


Gly 


Lys 


Pro 
75 


Gly 


Lys 


Arg 


Pro 


Gly 
80 


Thr 


Ala 


Ala 


Glu 


Val 
85 


Pro 


Pro 


Glu 


Lys 


Thr 
90 


Lys 


Asp 


Lys 


Gly 


Lys 
95 


Lys 


Gly 


Lys 


Lys 


Asp 
100 


Lys 


Gly 


Pro 


Lys 


Val 
105 


Pro 


Lys 


Glu 


Ser 


Deu 
110 


Glu 


Gly 


Ser- 


Pro 


Arg 
115 


Pro 


Pro 


Lys 


Lys 


Gly 
120 


Lys 


Glu 


Lys 


Pro 


Pro 
125 


Lys 


Ala 


Thr 


Lys 


Lys 
130 


Pro 


Lys 


Glu 


Lys 


Pro 
135 


Pro 


Lys 


Ala 


Thr 


Lys 
140 


Lys 


Pro 


Lys 


Glu 


Lys 
145 


Pro 


Pro 


Lys 


Ala 


Thr 
150 


Lys 


Lys 


Pro 


Lys 


Glu 
155 


Lys 


Pro 


Pro 


Lys 


Ala 
160 


Thr 


Lys 


Lys 


Pro 


Pro 
165 


Ser 


Gly 


Lys 


Arg 


Pro 


Pro 


He 


Leu 


Ala 


Pro 


Ser 


Glu 


Thr 


Leu 


Glu 








170 










175 










180 


Trp 


Pro 


Leu 


Pro 


Pro 
185 


Pro 


Pro 


Ser 


Pro 


Gly 
190 


Pro 


Glu 


Glu 


Leu 


Pro 
195 


Gin 


Glu 


Gly 


Gly 


Ala 


Pro 


Leu 


Ser 


Asn 


Asn 


Trp 


Gin 


Asn 


Pro 


Gly 
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200 






Glu 


Glu 


Thr 


His 


Val 
215 


Glu 


Ala 


Glu 


Glu 


Thr 


Glu 


Gin 
230 


Pro 


Thr 


Arg 


Glu 


Asp 


Tyr 


Glu 
245 


Asp 


Phe 


Pro 


Arg 


Pro 


Pro 


Pro 
260 


Ser 


Arg 


Pro 


Glu 


Thr 


Pro 


Glu 
275 


Glu 


Lys 


Arg 


He 


Glu 


Pro 


Pro 
290 


Val 


Lys 


Asp 


Tyr 


Gly 


Asp 


Gly 
305 


Tyr 


Val 


Tyr 


Tyr 


Phe 


Gly 


Pro 
320 


Pro 


Pro 


Gin 


Thr 


Asp 


Glu 


Glu 
335 


Lys 


Glu 


Asp 


Ser 


Ser 


Pro 


Lys 
350 


Glu 


Glu 


Gly 


Lys 


Asp 


His 


Lys 
365 


Glu 


Pro 


Glu 


Trp 


Thr 


Pro 


Thr 
380 


Glu 


Lys 


Glu 


Ser 


His 


Arg 


He 
395 


Glu 


Asp 


Leu 


Arg 


His 


Gly 


Leu 
410 


Gly 


Ala 


Thr 


Gly 


Ala 


Thr 


Glu 
425 


Asp 


Asp 


Glu 


Asp 


Asp 


Ala 


Arg 
440 


Thr 


Gin 


Thr 


Thr 


Arg 


Phe 


Thr 
455 


Gly 


Val 


lie 


His 


Asp 


Asp 


Phe 
470 


Val 


Thr 


Asp 


Ser 


Gin 


Thr 


Trp 
485 


Val 


Met 


Val 


Gly 


Thr 


Met 


Pro 
500 


Arg 


Leu 


Arg 


Gly 


Val 


Gly 


Ser 
515 


Gin 


Arg 


Cys 


Leu 


Ser 


Pro 


Asp 
530 


Leu 


Ser 


Arg 


Ala 


Glu 















205 










210 


Arg 


Glu 


His 


Arg 


Pro 


Glu 


Pro 


Glu 






220 










225 


Leu 


Asp 


Tyr 


Asn Asp 


Gin 


He 


Glu 






235 










240 


Glu 


Tyr 


He 


Arg 


Arg 


Gin 


Lys 


Gin 






250 










255 


Arg 


Arg 


Arg 


Pro 


Glu Arg Val 


Trp 






265 










270 


Ala 


Pro 


Ala 


Pro 


Ala 


Pro 


Glu 


Glu 






280 










285 


Pro 


Leu 


Leu 


Pro 


Pro 


Leu 


Pro 


Pro 






295 










300 


He 


Pro 


Asn 


Tyr 


Asp Asp Met 


Asp 






310 










315 


Pro 


Gin 


Lys 


Pro 


Asp 


Ala 


Glu 


Arg 






325 










330 


Glu 


Leu 


Lys 


Lys 


Pro 


Lys 


Lys 


Glu 






340 










345 


Thr 


Asp 


Lys 


Trp 


Ala 


Val 


Glu 


Lys 






355 










360 


Arg 


Lys 


Gly 


Glu 


Glu 


Leu 


Glu 


Glu 






370 










375 


Val 


Lys 


Cys 


Pro 


Pro 


He 


Gly 


Met 






385 










390 


Asn 


Gin 


He 


Arg 


Ala 


Ser 


Ser 


Met 






400 










405 


Gin 


Arg 


Gly 


Arg 


Leu 


Asn 


Met 


Gin 






415 










420 


Tyr 


Tyr 


Asp 


Gly Ala 


Trp 


Cys 


Ala 






430 










435 


Trp 


He 


Glu 


Val 


Asp 


Thr 


Arg 


Arg 






445 










450 


He 


Thr 


Gin 


Gly Arg Asp 


Ser 


Ser 






460 










465 


Thr 


Phe 


Phe 


Val 


Gly 


Phe 


Ser 


Asn 






475 










480 


Tyr 


Thr 


Asn 


Gly 


Tyr. 


Glu 


Glu 


Met 






490 










495 


Leu 


Ala 


Leu 


Leu 


Pro 


Leu 


Ser 


Gly 






505 










510 


Gly Trp Gin 


Tyr 


Cys 


Ser 


Glu 


Ala 






520 










525 


Trp 


Glu 


Arg 


Gly Gin Gly His 


Thr 






535 










540 



<210> 48 

<211> 742 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7510044CD1 

<400> 48 

Met Ala Thr Ala Gly Gly Gly Ser Gly Ala Asp Pro Gly Ser Arg 

1 5 . 10 15 

Gly Leu Leu Arg Leu Leu Ser Phe Cys Val Leu Leu Ala Gly Leu 

20 25 30 

Phe Leu Asp Phe Glu Pro Asp Pro Leu Pro Leu Cys Pro Asp Val 
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35 








40 








45 


Ser 


Phe 


Phe 


Pro 


Gin Pro 


Gin 


Pro 


Pro 


Thr 


Pro 


Gin 


Ser 


Lys Gly 








50 








55 








60 


Thr Gly Leu 


Cys 


Arg Gly Asn 


Ser 


Val 


Glu Arg 


Lys 


He 


Tyr He 










65 








70 








75 


Pro 


Leu 


Asn 


Lys 


Thr Ala 


Pro Cys Val 


Arg 


Leu 


Leu 


Asn 


Ala Thr 








80 








85 








90 


His 


Gin 


He 


Gly Cys Gin 


Ser 


Ser 


He 


Ser Gly Asp Thr Gly Val 










95 








100 








105 


He 


His 


Val 


Val 


Glu Lys 


Glu 


Glu 


Asp 


Leu 


Gin Trp Val 


Leu Thr 










110 








115 








120 


Asp Gly 


Pro 


Asn 


Pro Pro Tyr Met Val 


Leu 


Leu 


Glu 


Ser 


Lys His 










125 








130 








135 


Phe 


Thr 


Arg 


Asp 


Leu Met 


V3J.U 


Lys 


Leu 


Lys 


Gly Arg Thr 


Ser Arg 






140 








JL*±-J 








150 


He 


Ala 


Gly Leu 


Ala Val 


Ser 


Leu 


inr 


Lys 


Pro 


Ser 


Pro 


Ala Ser 










155 








1 fin 

X O VJ 








165 


Gly 


Phe 


Ser 


Pro 


Ser Val 


Gin 


Cys 


Pro 


Asn 


Asp 


Gly Phe Gly Val 








170 








1 '7^ 

JL / J 








180 


Tyr 


Ser 


Asn 


Ser 


Tyr Gly 


Pro 


Glu 


Phe 


Ala 


His 


Cys 


Arg 


Glu He 








185 








1 Q n 








195 


Gin 


Trp 


Asn 


Ser 


Leu Gly 


Asn 


Gly 


Leu 


Ala 


Tyr 


Glu 


Asp 


Phe Ser 








200 








^ u «p 








210 


Phe 


Pro 


He 


Phe 


Leu Leu 
215 


Glu 


Asp 


VjJ.U 


Asn 
220 


Glu 


Thr 


Lys 


Val He 
225 


Lys 


Gin 


Cys 


Tyr 


Gin Asp 


His 


Asn 


Leu 


O CL 


Gin 


Asn 


Gly 


Ser Ala 




230 
















240 


Pro 


Thr 


Phe 


Pro 


Leu Cys 
245 


Ala 


Met 


Gin 




Phe 


Ser 


His 


Met His 
255 


Ala 


Val 


He 


Ser 


Thr Ala 
260 


Thr 


Cys 


Met 


^ D 13 


Arg 


Arg 


Ser 


He Gin 
270 


Ser 


Thr 


Phe 


Ser 


He Asn 
275 


Pro 


Glu 


He 


Val 
280 


Cys 


Asp 


Pro 


Leu Ser 
285 


Asp 


Tyr 


Asn 


Val 


Trp Ser 


Met 


Leu 


Lys 


Pro 


He 


Asn 


Thr 


Thr Gly 






290 








295 








300 


Thr 


Leu 


Lys 


Pro Asp Asp Arg Val 


Val 


Val 


Ala 


Ala 


Thr 


Arg Leu 








305 








310 








315 


Asp 


Ser 


Arg 


Ser 


Phe Phe 


Trp 


Asn 


Val 


Ala 


Pro Gly Ala Glu Ser 






320 








325 








330 


Ala 


Val 


Ala 


Ser 


Phe Val 
335 


Thr 


Gin 


Leu 


Ala 
340 


Ala 


Ala 


Glu 


Ala Leu 
345 


Gin 


Lys 


Ala 


Pro 


Asp Val 


Thr 


Thr 


Leu 


Pro 


Arg 


Asn 


Val 


Met Phe 








350 








355 








360 


Val 


Phe 


Phe Gin Gly Glu 


Thr 


Phe 


Asp 


Tyr 


He 


Gly 


Ser 


Ser Arg 










365 








370 








375 


Met 


Val 


Tyr 


Asp 


Met Glu 


Lys 


Gly Lys 


Phe 


Pro 


Val 


Gin 


Leu Glu 








380 








385 








390 


Asn 


Val 


Asp 


Ser 


Phe Val 


Glu 


Leu 


Gly Gin Val 


Ala 


Leu 


Arg Thr 








395 








400 








405 


Ser 


Leu 


Glu 


Leu 


Trp Met 
410 


His 


Thr 


Asp 


Pro 

415 


Val 


Ser 


Gin 


Lys Asn 
420 


Glu 


Ser 


Val 


Arg 


Asn Gin 


Val 


Glu 


Asp 


Leu 


Leu 


Ala 


Thr 


Leu Glu 








425 








430 








435 


Lys 


Ser 


Gly Ala 


Gly Val 


Pro 


Ala 


Val 


He 


Leu 


Arg 


Arg 


Pro Asn 








440 








445 








450 


Gin 


Ser 


Gin 


Pro 


Leu Pro 
455 


Pro 


Ser 


Ser 


Leu 
460 


Gin 


Arg 


Phe 


Leu Arg 
465 


Ala 


Arg 


Asn 


He 


Ser Gly Val 


Val 


Leu 


Ala 


Asp 


His 


Ser 


Gly Ala 








470 








475 








480 


Phe 


His 


Asn 


Lys 


Tyr Tyr 


Gin 


Ser 


He 


Tyr 


Asp 


Thr 


Ala 


Glu Asn 








485 








490 








495 


He 


Asn 


Val 


Ser 


Tyr Pro 
500 


Glu 


Trp 


Leu 


Ser 
505 


Pro 


Glu 


Glu 


Asp Leu 
510 
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ASXI 


Phe 


Val 


Thr 


Asp 


Thr 


Ala 


Lys 


Ala 


Leu 


Ala 


Asp Val 


Ala 


Thr 










515 










520 










525 


Val 


Leu 


Gly 


Arg 


Ala 


Leu 


Tyr 


Glu 


Leu 


Ala 


Gly Gly Thr Asn 


Phe 










530 










535 










540 


Ser 


Asp 


Thr 


Val 


Gin 


Ala 


Asp 


Pro 


Gin 


Thr 


Val 


Thr 


Arg 


Leu 


Leu 








545 










550 










555 


Tyr 


Gly 


Phe 


Leu 


He 


Lys 


Ala 


Asn 


Asn 


Ser 


Trp 


Phe 


Gin 


Ser 


He 










560 










565 










570 


Leu 


Arg 


Gin 


Asp 


Leu 


Arg 


Ser 


Tyr 


Leu 


Gly 


Asp 


Gly 


Pro 


Leu 


Gin 










575 










580 










585 


His 


Tyr 


lie 


Ala 


Val 


Ser 


Ser 


Pro 


Thr 


Asn 


Thr 


Thr 


Tyr 


Val 


Val 








590 










595 










600 


Gin 


Tyr 


Ala 


Leu 


Ala 


Asn 


Leu 


Thr 


Gly 


Thr 


Val 


Val 


Asn 


Leu 


Thr 










605 










610 










615 


Arg 


Glu 


Gin 


Cys 


Gin 


Asp 


Pro 


Ser 


Lys 


Val 


Pro 


Ser 


Glu 


Asn 


Lys 








620 










625 










630 


Asp 


Leu 


Tyr 


Glu 


Tyr 


Ser 


Trp 


Val 


Gin 


Gly 


Pro 


Leu 


His 


Ser 


Asn 










635 










640 










645 


Glu 


Tlir 


Asp 


Arg 


Leu 


Pro 


Arg 


Cys 


Val 


Arg 


Ser 


Thr 


Ala 


Arg 


Leu 










650 










655 










660 


Ala 


Arg 


Ala 


Leu 


Ser 


Pro 


Ala 


Phe 


Glu 


Leu 


Ser 


Gin 


Trp 


Ser 


Ser 










665 










670 










675 


Thr 


Glu 


Tyr 


Ser 


Thr 


Trp 


Thr 


Glu 


Ser 


Arg 


Trp 


Lys 


Asp 


He 


Arg 








680 










685 










690 


Ala 


Arg 


lie 


Phe 


Leu 


He 


Ala 


Ser 


Lys 


Glu 


Leu 


Glu 


Leu 


He 


Thr 










695 










700 










705 


Leu 


Thr 


Val 


Gly 


Phe 


Gly 


He 


Leu 


He 


Phe 


Ser 


Leu 


He 


Val 


Thr 










710 










715 










720 


Tyr 


Cys 


He 


Asn 


Ala 


Lys 


Ala 


Asp 


Val 


Leu 


Phe 


He 


Ala 


Pro 


Arg 










725 










730 










735 


Glu 


Pro 


Gly 


Ala 


Val 


Ser 


Tyr 



















740 



<210> 49 
<211> 47 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7504509CD1 

<400> 49 

Met Ser Gly His Ser Pro Thr Arg 

1 5 
Asn Gly Lys Ala Arg Lys Glu Ala 
20 

Leu Leu Lys Phe Val Asn Arg Ser 
35 

Leu Gin 



Gly Ala Met Gin Val Ala Met 
10 15 

Val Gin Thr Ala Ala Lys Glu 
25 30 

Pro Ser Pro Phe His Gly Trp 
40 45 



<210> 50 
<211> 74 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7506825CD1 

<400> 50 

Met Thr Ser Lys Gly Pro Glu Glu Glu His Pro Ser Val Thr Leu 
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1 




5 




10 










15 


Phe 


Arg Gin Tyr 


Leu Arg lie Arg 


Thr 


Val 


Gin 


Pro 


Lys 


Pro 


Asp 




20 




25 










30 


Tyr 


Gly Gly Thr 


Trp Leu Cys Gly 


Asp 


Arg 


Val 


Asp 


Leu 


Ala 


Arg 




35 




40 










45 


His 


Gin Pro Tyr 


Thr Leu Leu His 


Leu 


Ala 


Gin 


Leu 


Pro 


His 


Gly 






50 




55 










60 


Cys 


Gly Ala Cys 


Leu Gin Gly Thr 


Leu 


Glu 


Ser 


Arg 


Pro 


Leu 








65 




70 













<210> 51 
<211> 343 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7506828CD1 

<400> 51 



Met 


Thr 


Ser 


Lys 


Gly Pro 


Glu 


Glu 


Glu 


His 


Pro 


Ser 


Val 


Thr 


Leu 


1 






5 










10 










15 


Phe 


Arg 


Gin 


Tyr 


Leu 


Arg 


He 


Arg 


Thr 


Val 


Gin 


Pro 


Lys 


Pro 


Asp 








20 










25 










30 


Tyr 


Gly 


Ala 


Ala 


Val 


Ala 


Phe 


Phe 


Glu 


Glu 


Thr 


Ala 


Arg 


Gin 


Leu 






35 










40 










45 


Gly 


Leu 


Gly 


Cys 


Gin 


Lys 


Val 


Glu 


Val 


Ala 


Pro Gly Tyr Val 


Val 








50 










55 










60 


Thr 


Val 


Leu 


Thr 


Trp 
65 


Pro 


Gly 


Thr 


Asn 


Pro 
70 


Thr 


Leu 


Ser 


Ser 


He 
75 


Leu 


Leu 


Asn 


Ser 


His 
80 


Thr 


Asp 


Val 


Val 


Pro 

85 


Val 


Phe 


Lys 


Glu 


His 
90 


Trp 


Ser 


His 


Asp 


Pro 


Phe 


Glu 


Ala 


Phe 


Lys 


Asp 


Ser 


Glu 


Gly Tyr 








95 










100 










105 


He 


Tyr 


Ala 


Arg 


Gly Ala 


Gin 


Asp 


Met 


Lys 


Cys 


Val 


Ser 


He 


Gin 








110 










115 










120 


Tyr 


Leu 


Glu 


Ala 


Val 


Arg 


Arg 


Leu 


Lys 


Val 


Glu 


Gly His 


Arg 


Phe 








125 










130 










135 


Pro 


Arg 


Thr 


He 


His 


Met 


Thr 


Phe 


Val 


Pro 


Asp 


Glu 


Glu 


Val 


Gly 








140 










145 










150 


Gly 


His 


Gin 


Gly 


Met 


Glu 


Leu 


Phe 


Val 


Gin 


Arg 


Pro 


Glu 


Phe 


His 






155 










160 










165 


Ala 


Leu 


Arg 


Ala 


Gly 


Phe 


Ala 


Leu 


Asp 


Glu 


Gly 


He 


Ala 


Asn 


Pro 








170 










175 










180 


Thr 


Asp 


Ala 


Phe 


Thr 
185 


Val 


Phe 


Tyr 


Ser 


Glu 
190 


Arg 


Ser 


Pro 


Trp 


Trp 
195 


Val 


Arg 


Val 


Thr 


Ser 


Thr 


Gly 


Arg 


Pro 


Gly His 


Ala 


Ser 


Arg 


Phe 








200 










205 










210 


Met 


Glu 


Asp 


Thr 


Ala 


Ala 


Glu 


Lys 


Leu 


Ala 


Phe 


Glu 


Glu 


Gin 


Leu 








215 










220 










225 


Gin 


Ser 


Trp 


Cys 


Gin 


Ala 


Ala 


Gly 


Glu 


Gly Val 


Thr 


Leu 


Glu 


Phe 






230 










235 










240 


Ala 


Gin 


Lys 


Trp 


Met 


His 


Pro 


Gin 


Val 


Thr 


Pro 


Thr 


Asp 


Asp 


Ser 






245 










250 










255 


Asn 


Pro 


Trp 


Trp 


Ala 


Ala 


Phe 


Ser 


Arg 


Val 


Cys 


Lys 


Asp 


Met 


Asn 








260 










265 










270 


Leu 


Thr 


Leu 


Glu 


Pro 
275 


Glu 


He 


Met 


Pro 


Ala 
280 


Ala 


Thr 


Asp 


Asn 


Arg 
285 


Tyr 


He 


Arg 


Ala 


Val 


Gly 


Val 


Pro 


Ala 


Leu 


Gly 


Phe 


Ser 


Pro 


Met 








290 










295 










300 


Asn 


Arg 


Thr 


Pro 


Val 


Leu 


Leu 


His 


Asp 


His 


Asp 


Glu 


Arg 


Leu 


His 








305 










310 










315 


Glu 


Ala 


Val 


Phe 


Leu 


Arg 


Gly 


Val 


Asp 


He 


Tyr 


Thr 


Arg 


Leu 


Leu 
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320 

Pro Ala Leu Ala Ser 
335 



325 

Val Pro Ala Leu Pro 
340 



330 



Ser Asp Ser 



<210> 52 
<211> 373 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7506831CD1 

<400> 52 



Met 


Thr 


Ser 


Lys 


Gly 


Pro 


Glu 


Glu 


Glu 


His 


Pro 


Ser 


Val 


Thr 


Leu 


1 






5 










10 










15 


Phe 


Arg 


Gin 


Tyr 


Leu 


Arg 


He 


Arg 


Thr 


Val 


Gin 


Pro 


Lys 


Pro 


Asp 










20 










25 










30 


Tyr 


Gly 


Thr 


Asn 


Pro 


Thr 


Leu 


Ser 


Ser 


He 


Leu 


Leu 


Asn 


Ser 


His 








35 










40 










45 


Thr Asp 


Val 


Val 


Pro 


Val 


Phe 


Lys 


Glu 


His 


Trp 


Ser 


His 


Asp 


Pro 










50 










55 










60 


Phe 


Glu 


Ala 


Phe 


Lys 


Asp 


Ser 


Glu 


Gly Tyr 


He 


Tyr 


Ala 


Arg 


Gly 










65 










70 










75 


Ala 


Gin 


Asp 


Met 


Lys 


Cys 


Val 


Ser 


He 


Gin 


Tyr 


Leu 


Glu 


Ala 


Val 








80 










85 










90 


Arg 


Arg 


Leu 


Lys 


Val 


Glu 


Gly 


His 


Arg 


Phe 


Pro 


Arg 


Thr 


He 


His 










95 










100 










105 


Met 


Thr 


Phe 


Val 


Pro 


Asp 


Glu 


Glu 


Val 


Gly 


Gly His 


Gin Gly 


Met 










110 










115 










120 


Glu 


Leu 


Phe 


Val 


Gin 


Arg 


Pro 


Glu 


Phe 


His 


Ala 


Leu 


Arg 


Ala 


Gly 










125 










130 










135 


Phe 


Ala 


Leu 


Asp 


Glu 


Gly 


He 


Ala 


Asn 


Pro 


Thr 


Asp 


Ala 


Phe 


Thr 










140 










145 










150 


Val 


Phe 


Tyr 


Ser 


Glu 


Arg 


Ser 


Pro 


Trp 


Trp 


Val 


Arg 


Val 


Thr 


Ser 










155 










160 










165 


Thr 


Gly 


Arg 


Pro 


Gly 


His 


Ala 


Ser 


Arg 


Phe 


Met 


Glu 


Asp 


Thr 


Ala 










170 










175 










180 


Ala 


Glu 


Lys 


Leu 


His 


Lys 


Val 


Val 


Asn 


Ser 


He 


Leu 


Ala 


Phe 


Arg 










185 










190 










195 


Glu 


Lys 


Glu 


Trp 


Gin 


Arg 


Leu 


Gin 


Ser 


Asn 


Pro 


His 


Leu 


Lys 


Glu 








200 










205 










210 


Gly 


Ser 


Val 


Thr 


Ser 


Val 


Asn 


Leu 


Thr 


Lys 


Leu 


Glu 


Gly 


Gly 


Val 








215 










220 










225 


Ala 


Tyr 


Asn 


Val 


He 


Pro 


Ala 


Thr 


Met 


Ser 


Ala 


Ser 


Phe 


Asp 


Phe 








230 










235 










240 


Arg 


Val 


Ala 


Pro 


Asp 


Val 


Asp 


Phe 


Lys 


Ala 


Phe 


Glu 


Glu 


Gin 


Leu 










245 










250 










255 


Gin 


Ser 


Trp 


Cys 


Gin 


Ala 


Ala 


Gly Glu 


Gly 


Val 


Thr 


Leu 


Glu 


Phe 










260 










265 










270 


Ala 


Gin 


Lys 


Trp 


Met 


His 


Pro 


Gin 


Val 


Thr 


Pro 


Thr 


Asp 


Asp 


Ser 








275 










280 










285 


Asn 


Pro 


Trp 


Trp 


Ala 


Ala 


Phe 


Ser 


Arg 


Val 


Cys 


Lys 


Asp 


Met 


Asn 










290 










295 










300 


Leu 


Thr 


Leu 


Glu 


Pro 


Glu 


He 


Met 


Pro 


Ala 


Ala 


Thr 


Asp 


Asn 


Arg 










305 










310 










315 


Tyr 


lie 


Arg 


Ala 


Val 


Gly 


Val 


Pro 


Ala 


Leu 


Gly 


Phe 


Ser 


Pro 


Met 








320 










325 










330 


Asn Arg 


Thr 


Pro 


Val 


Leu 


Leu 


His 


Asp His 


Asp 


Glu 


Arg 


Leu 


His 










335 










340 










345 


Glu 


Ala 


Val 


Phe 


Leu 


Arg 


Gly 


Val 


Asp 


He 


Tyr 


Thr 


Arg 


Leu 


Leu 










350 










355 










360 


Pro 


Ala 


Leu 


Ala 


Ser 


Val 


Pro 


Ala 


Leu 


Pro 


Ser 


Asp 


Ser 
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<400> 53 



Met 


Tro 


Pro Leu 


Ala 


Leu 


Val 


He 


Ala 


Ser 


Leu Thr 


Leu 


Ala 


Leu 


1 




5 










10 








15 


Ser 


Gly 


Gly Val 


Ser 


Gin 


Glu 


Ser 


Ser 


Lys 


Val Leu 


Asn 


Thr 


Asn 






20 










25 








30 


Gly 


Thr 


Ser Gly 


Phe 


Leu 


Pro 


Gly Gly 


Tyr 


Thr Cys 


Phe 


Pro 


His 






35 










40 








45 


Ser 


Gin 


Pro Trp 


Gin 


Ala 


Ala 


Leu 


Leu 


Val 


Gin Gly Arg 


Leu 


Leu 








50 










55 








60 


Cys 


Gly 


Gly Val 


Leu 


Val 


His 


Pro 


Lys 


Trp 


Val Leu 


Thr 


Ala 


Ala 


65 










70 








75 


His 


Cys 


Leu Lys 


Glu 


Gly 


Leu 


Lys 


Val 


Tyr 


Leu Gly Lys 


His 


Ala 








80 










85 








90 


Leu 


Gly 


Arg Val 


Glu 


Ala 


Gly 


Glu 


Gin 


Val 


Arg Glu 


Val 


Val 


His 




95 










100 








105 


Ser 


lie 


Pro His 


Pro 


Glu 


Tyr 


Arg 


Arg 


Ser 


Pro Thr 


His 


Leu 


Asn 








110 










115 








120 


His 


Asp 


His Asp 


He 


Met 


Leu 


Leu 


Glu 


Leu 


Gin Ser 


Pro 


Val 


Gin 






125 










130 








135 


Leu 


Thr 


Gly Tyr 


He 


Gin 


Thr 


Leu 


Pro 


Leu 


Ser His 


Asn 


Asn 


Arg 






140 










145 








150 


Leu 


Thr 


Pro Gly 


Thr 


Thr 


Cys 


Arg 


Val 


Ser 


Gly Trp 


Gly Thr 


Thr 






155 










160 








165 


Thr 


Ser 


Pro Gin 


Gly 


Met 


His 


Pro 


His 


Arg 


Trp Pro 


Glu 


Ala 


Pro 



170 175 180 
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<400> 54 



Met 


Asp 


Phe 


Ser 


Arg 


Asn 


Leu 


Tyr 


Asp 


He 


Gly Glu 


Gin 


Leu 


Asp 


1 






5 










10 










15 


Ser 


Glu 


Asp 


Leu 


Ala 


Ser 


Leu 


Lys 


Phe 


Leu 


Ser 


Leu 


Asp 


Tyr 


He 








20 










25 










30 


Pro 


Gin 


Arg 


Lys 


Gin 


Glu 


Pro 


He 


Lys 


Asp 


Ala 


Leu 


Met 


Leu 


Phe 






35 










40 










45 


Gin 


Arg 


Leu 


Gin 


Glu 


Lys 


Arg 


Met 


Leu 


Glu 


Glu 


Ser 


Asn 


Leu 


Ser 








50 










55 










60 


Phe 


Leu 


Lys 


Glu 


Leu 


Leu 


Phe 


Arg 


He 


Asn 


Arg 


Leu 


Asp 


Leu 


Leu 








65 










70 










75 


He 


Thr 


Tyr 


Leu 


Asn 


Thr 


Arg 


Lys 


Glu 


Glu 


Met 


Glu 


Arg 


Glu 


Leu 








80 










85 










90 


Gin 


Thr 


Pro 


Gly 


Arg 


Ala 


Gin 


He 


Ser 


Ala 


Tyr 


Arg 


Phe 


His 


Phe 








95 










100 










105 


Cys 


Arg 


Met 


Ser 


Trp 


Ala 


Glu 


Ala 


Asn 


Ser 


Gin 


Cys 


Gin 


Thr 


Gin 
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110 






Ser 


Val 


Pro 


Phe 


Trp Arg 


Arg 










125 






Met 


Leu 


Tyr 


Gin 


He 
140 


Ser 


Glu 


Ser 


Phe 


Lys 


Phe 


Leu 
155 


Leu 


Gin 


Asp 


Asp 


Asp Met 


Asn 


Leu 


Leu 










170 






Arg 


Val 


lie- 


Leu 


Gly Glu 


Gly 










185 






Cys 


Ala 


Gin 


lie 


Asn 
200 


Lys 


Ser 


Glu 


Glu 


Phe 


Ser 


Lys 
215 


Gly 


Glu 


Ser 


Asp 


Ser 


Pro 


Arg 
230 


Glu 


Gin 


Iiys 


Val 


Tyr 


Gin 


Met 
245 


Lys 


Ser 


lie 


Asn 


Asn 


His 


Asn 
260 


Phe 


Ala 


Leu 


His 


Ser 


He 


Arg 
275 


Asp 


Arg 


Thr 


Val 


Glu 


Pro 


Lys 
290 


Arg 


Glu 







115 








120 


Val 


Asp 


His 


Leu 


Leu 


He Arg Val 






130 








135 


Glu 


Val 


Ser 


Arg 


Ser 


Glu 


Leu Arg 






145 








150 


Glu 


Glu 


He 


Ser 


Lys 


Cys 


Lys Leu 






160 








165 


Asp 


He 


Phe 


He 


Glu 


Met 


Glu Lys 






175 








180 


Lys 


Leu 


Asp 


He 


Leu 


Lys 


Arg Val 






190 








195 


Leu 


Leu 


Lys 


He 


He 


Asn Asp Tyr 






205 








210 


Glu 


Leu 


Cys 


Gly Val 


Met 


Thr He 






220 








225 


Asp 


Ser 


Glu 


Ser 


Gin 


Thr 


Leu Asp 






235 








240 


Lys 


Pro 


Arg 


Gly Tyr 


Cys 


Leu He 






250 








255 


Lys 


Ala 


Arg 


Glu 


Lys 


Val 


Pro Lys 






265 








270 


Asn 


Gly 


Thr 


His 


Leu 


Asp 


Ala Gly 






280 








285 


Lys 
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<400> 55 


























Met 


Asp 


Phe 


Ser 


Arg 


Asn 


Leu 


Tyr 


Asp 


He 


Gly Glu Gin Leu Asp 


1 








5 










10 










15 


Ser 


Glu 


Asp 


Leu 


Ala 


Ser 


Leu 


Lys 


Phe 


Leu 


Ser 


Leu 


Asp 


Tyr 


He 










20 










25 










30 


Pro 


Gin 


Arg 


Lys 


Gin 


Glu 


Pro 


He 


Lys 


Asp 


Ala 


Leu 


Met 


Leu 


Phe 










35 










40 










45 


Gin 


Arg 


Leu 


Gin 


Glu 


Lys 


Arg 


Met 


Leu 


Glu 


Glu 


Ser 


Asn 


Leu 


Ser 








50 










55 










60 


Phe 


Leu 


Lys 


Glu 


Leu 


Leu 


Phe 


Arg 


He 


Asn 


Arg 


Leu 


Asp 


Leu 


Leu 










65 










70 










75 


He 


Thr 


Tyr 


Leu 


Asn 


Thr 


Arg 


Lys 


Glu 


Glu 


Met 


Glu 


Arg 


Glu 


Leu 








80 










85 










90 


Gin 


Thr 


Pro 


Gly 


Arg 


Ala 


Gin 


He 


Ser 


Ala 


Tyr 


Arg 


Phe 


His 


Phe 








95 










100 










105 


Cys 


Arg 


Met 


Ser 


Trp 


Ala 


Glu 


Ala 


Asn 


Ser 


Gin Cys 


Gin 


Thr 


Gin 






110 










115 










120 


Ser 


Val 


Pro 


Phe 


Trp 


Arg 


Arg 


Val 


Asp 


His 


Leu 


Leu 


He Arg Val 










125 










130 










135 


Met 


Leu 


Tyr 


Gin 


He 


Ser 


Glu 


Glu 


Val 


Ser 


Arg 


Ser 


Glu 


Leu 


Arg 








140 










145 










150 


Ser 


Phe 


Lys 


Phe 


Leu 


Leu 


Gin 


Glu 


Glu 


He 


Ser 


Lys 


Cys 


Lys 


Leu 








155 










160 










165 


Asp 


Asp 


Asp 


Met 


Asn 


Leu 


Leu 


Asp 


He 


Phe 


He 


Glu 


Met 


Glu 


Lys 






170 










175 










180 


Arg 


Val 


He 


Leu 


Gly 


Glu 


Gly 


Lys 


Leu 


Asp 


He 


Leu 


Lys 


Arg 


Val 








185 










190 










195 


Cys 


Ala 


Gin 


He 


Asn 


Lys 


Ser 


Leu 


Leu 


Lys 


He 


He 


Asn 


Asp 


Tyr 
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200 



Glu 


Glu 


Phe 


Ser 


Lys 


Glu 


Arg 


Ser 










215 








Asp 


Glu 


Phe 


Ser 


Asn 


Gly Glu 


Glu 










230 








Ser 


Asp 


Ser 


Pro Arg Glu Gin Asp 










245 








Lys 


Val 


Tyr 


Gin 


Met 


Lys 


Ser 


Lys 










260 








lie 


Asn 


Asn 


His 


Asn 


Phe 


Ala 


Lys 










275 








Leu 


His 


Ser 


lie 


Arg 


Asp 


Arg 


Asn 










290 








Ala 


Leu 


Thr 


Thr 


Thr 


Phe 


Glu 


Glu 










305 








His 


Asp 


Asp 


Cys 


Thr 


Val 


Glu 


Gin 










320 








Tyr 


Gin 


Leu 


Met 


Asp 


His 


Ser 


Asn 










335 








lie 


Leu 


Ser 


His 


Gly Asp 


Lys 


Gly 










350 








Gin 


Glu 


Ala 


Pro 


lie 


Tyr 


Glu 


Leu 










365 








Lys 


Cys 


Pro 


Ser Leu Ala Gly Lys 










380 








Ala 


Cys 


Gin 


Gly Asp 


Asn 


Tyr 


Gin 










395 








Asp 


Ser 


Glu 


Glu 


Gin 


Pro 


Tyr 


Leu 










410 








Gin 


Thr 


Arg 


Tyr 


He 


Pro 


Asp 


Glu 










425 








Ala 


Thr 


Val 


Asn 


Asn 


Cys 


Val 


Ser 










440 








Thr 


Trp 


Tyr 


lie 


Gin 


Ser 


Leu 


Cys 










455 








Pro 


Arg 


Gly 


Asp 


Asp 


He 


Leu 


Thr 










470 








Glu 


Val 


Ser 


Asn 


Lys 


Asp 


Asp 


Lys 










485 








Pro 


Gin 


Pro 


Thr 


Phe 


Thr 


Leu 


Arg 



500 

Asp 
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<400> 56 

Met Ala Gin Lys Glu Gly Gly Arg 

1 5 
Pro Lys Val Ala Ala Leu Thr Ala 
20 

Ala He Gly Ala Ala Ser Trp Ala 
35 

Ala Ser Asp Leu Pro Trp Pro Leu 

50 

Ser Pro Leu Pro Ser Leu Gin Trp 





205 










210 


Ser 


Ser 


Leu 


Glu 


Gly 


Ser 


Pro 




220 










225 


Leu 


Cys 


Gly Val 


Met 


Thr 


He 




235 










240 


Ser 


Glu 


Ser 


Gin 


Thr 


Leu 


Asp 




250 










255 


Pro 


Arg 


Gly Tyr Cys 


Leu 


He 




265 










270 


Ala 


Arg 


Glu 


Lys 


Val 


Pro 


Lys 




280 










285 


Gly 


Thr 


His 


Leu 


Asp 


Ala 


Gly 




295 










300 


Leu 


His 


Phe 


Glu 


He 


Lys 


Pro 




310 










315 


He 


Tyr 


Glu 


He 


Leu 


Lys 


He 




325 










330 


Met 


Asp 


Cys 


Phe 


He 


Cys 


Cys 




340 










345 


He 


He 


Tyr Gly Thr Asp 


Gly 




355 










360 


Thr 


Ser 


Gin 


Phe 


Thr 


Gly 


Leu 




370 










375 


Pro 


Lys 


Val 


Phe 


Phe 


He 


Gin 




385 










390 


Lys 


Gly 


He 


Pro 


Val 


Glu 


Thr 




400 










405 


Glu 


Met 


Asp 


Leu 


Ser 


Ser 


Pro 




415 










420 


Ala 


Asp 


Phe 


Leu 


Leu 


Gly 


Met 




430 










435 


Tyr 


Arg 


Asn 


Pro 


Ala 


Glu 


Gly 




445 










450 


Gin 


Ser 


Leu 


Arg 


Glu 


Arg 


Cys 




460 










465 


He 


Leu 


Thr 


Glu 


Val 


Asn 


Tyr 




475 










480 


Lys 


Asn 


Met 


Gly Lys 


Gin 


Met 




490 










495 


Lys 


Lys 


Leu Val 


Phe 


Pro 


Ser 




505 










510 



Thr 


Val 


Pro 


Cys 


Cys 


Ser 


Arg 




10 










15 


Gly 


Thr 


Leu 


Leu 


Leu 


Leu 


Thr 




25 










30 


He 


Gly 


Glu 


Ser 


Gly 


Ser 


Cys 




40 










45 


Gin 


Pro 


Pro 


Ala 


Cys 


Leu 


Pro 




55 










60 


Leu 


Phe 


Ser 


Ser 


Gly Val 


Thr 
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65 










70 










75 


Arg Ser Arg Cys 


Thr 


Gin 


Cys 


Lys 


Thr 


Trp 


Arg 


Leu 


Leu 


Cys 


Ser 








80 










85 










90 


Ser Arg 


Ser 


Asn 


Ala 


Arg 


Val 


Ala 


Gly Leu 


Ser 


Cys 


Glu 


Glu 


Met 






95 










100 










105 


Gly Phe 


Leu 


Arg 


Ala 


Leu 


Thr 


His 


Ser 


Glu 


Leu 


Asp 


Val 


Arg 


Thr 






110 










115 










120 


Ala Gly Ala Asn 


Gly Thr 


Ser 


Gly Phe Phe Cys Val Asp Glu Gly 








125 










130 










135 


Arg Leu 


Pro 


His 


Thr 


Gin 


Arg 


Leu 


Leu 


Glu val 


He 


Ser 


Val 


Cys 






140 










145 










150 


Asp Cys 


Pro 


Arg 


Gly Arg 


Phe 


Leu 


Ala 


Ala 


He 


Cys 


Gin Asp 


Cys 








155 










160 










165 


Gly Arg Arg Lys 


Leu 


Pro 


Val 


Aso 


Arg 


He 


Val 


Gly Gly Arg 


Asp 








170 










175 










180 


Thr Ser 


Leu Gly Arg 


Trp 


Pro 


Trp 


Gin 


Val 


Ser 


Leu 


Arg 


Tvr 


Asp 








185 










190 










195 


Gly Ala 


His 


Leu 


Cys 


Gly Gly 


Ser 


Leu 


Leu 


Ser 


Gly Asp 


Trp 


Val 






200 










205 










210 


Leu Thr 


Ala 


Ala 


His 


Cys 


Phe 


Pro 


Glu 


Arg 


Asn 


Arg 


Val 


Leu 


Ser 








215 










220 










225 


Arg Trp 


Arg 


Val 


Phe 


Ala 


Gly 


Ala 


Val 


Ala 


Gin 


Ala 


Ser 


Pro 


His 






230 










235 










240 


Gly Leu 


Gin 


Leu 


Gly Val 


Gin 


Ala 


Val 


Val 


Tyr 


His 


Gly 


Glv 


Tyr 






245 










250 










255 


Leu Pro 


Phe 


Arg 


Asp 


Pro 


Asn 


Ser 


Glu 


Glu 


Asn 


Ser 


Asn 


Asp 


He 








260 










265 










270 


Ala Leu 


Val 


His 


Leu 


Ser 


Ser 


Pro 


Leu 


Pro 


Leu 


Thr 


Glu 


Tyr 


He 








275 










280 










285 


Gin Pro 


Val 


Cys 


Leu 


Pro 


Ala 


Ala 


Gly Gin Ala 


Leu 


Val 


Asp 


Gly 








290 










295 










300 


Lys lie 


Cys 


Thr 


Val 


Thr 


Gly 


Trp Gly Asn Thr Gin 


Tyr 


Tyr 


Gly 






305 










310 










315 


Gin Gin Ala Gly Val 


Leu 


Gin 


Glu Ala Arg Val 


Pro 


He 


He 


Ser 








320 










325 










330 


Asn Asp 


Val 


Cys 


Asn Gly Ala 


Asp 


Phe 


Tyr 


Gly Asn 


Gin 


He 


Lys 






335 










340 










345 


Pro Lys 


Met 


Phe 


Cys 


Ala 


Gly 


Tyr 


Pro Glu Gly Gly 


He 


Asp 


Ala 






350 










355 










360 


Cys Gin 


Gly Asp 


Ser 


Gly Gly 


Pro 


Phe 


Val 


Cys 


Glu 


Asp 


Ser 


He 






365 










370 










375 


Ser Arg 


Thr 


Pro 


Arg 


Trp 


Arg 


Leu 


Cys 


Gly 


He 


Val 


Ser 


Trp 


Gly 






380 










385 










390 


Thr Gly Cys Ala 


Leu 


Ala 


Gin 


Lys 


Pro 


Gly Val 


Tyr 


Thr 


Lys 


Val 








395 










400 










405 


Ser Asp 


Phe 


Arg 


Glu 


Trp 


lie 


Phe 


Gin 


Ala 


He 


Lys 


Thr 


His 


Ser 






410 










415 










420 


Glu Ala 


Ser Gly Met 


Val 


Thr 


Gin 


Leu 




















425 
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Met Ala Asn Val Gly Leu Gin Phe Gin Ala Ser Ala Gly Asp Ser 
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Asp Pro Gin Ser Arg Pro Leu Leu Leu Leu Gly Gin Leu His His 
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20 25 30 



Leu 


His 


Arg Val 


Pro 


Trp 


Ser 


His 


Val 


Arg 


Gly Lys 


Leu 


Gin 


Pro 










35 










40 










45 


Arg Val 


Thr 


Glu 


Glu 


Leu 


Trp 


Gin 


Ala 


Ala 


Leu 


Ser 


Thr 


Leu 


Asn 










50 










55 










60 


Pro 


Asn 


Pro 


Thr 


Asp 
65 


Ser 


Cys 


Pro 


Leu 


Tyr 
70 


Leu 


Asn 


Tyr 


Ala 


Thr 
75 


Val 


Ala 


Ala 


Leu 


Pro 
80 


Cys 


Arg 


Val 


Ser 


Arg 
85 


His 


Asn 


Ser 


Pro 


Ser 
90 


Ala 


Ala 


His 


Phe 


He 
95 


Thr 


Arg 


Leu 


Val 


Arg 
100 


Thr 


Cys 


Leu 


Pro 


Pro 
105 


Gly Ala 


His 


Arg 


Cys 


He 


Val 


Met 


Val 


Cys 


Glu 


Gin 


Pro 


Glu 


Val 










110 










115 










120 


Phe 


Ala 


Ser 


Ala 


Cys 
125 


Ala 


Leu 


Ala 


Arg 


Ala 
130 


Phe 


Pro 


Leu 


Phe 


Thr 
135 


His 


Arg 


Ser Gly 


Ala 


Ser 


Arg 


Arg 


Leu 


Glu 


Lys 


Lys 


Thr 


Val 


Thr 








140 










145 










150 


Val 


Glu 


Phe 


Phe 


Leu 


Val 


Gly 


Gin Asp 


Asn 


Gly Pro 


Val 


Glu 


Val 










155 










160 










165 


Ser 


Thr 


Leu 


Gin 


Cys 


Leu 


Ala 


Asn 


Ala 


Thr 


Asp Gly 


Val 


Arg 


Leu 










170 










175 










180 


Ala 


Ala 


Arg 


He 


Val 


Asp 


Thr 


Pro 


Cys 


Asn 


Glu 


Met 


Asn 


Thr 


Asp 








185 










190 










195 


Thr 


Phe 


Leu 


Glu 


Glu 
200 


He 


Asn 


Lvs 


Val 


Gly 

205 


Lys 


Glu 


Leu 


Gly 


He 
210 


He 


Pro 


Thr 


He 


He 
215 


Arg 


Asp 


Glu 


Glu 


Leu 
220 


Lys 


Thr 


Arg 


Gly 


Phe 

225 


Gly 


Gly 


He 


Tyr 


Gly 
230 


Val 


Gly 


Lys 


Ala 


Ala 
235 


Leu 


His 


Pro 


Pro 


Ala 
240 


Leu 


Ala 


Val 


Leu 


Ser 
245 


His 


Thr 


Pro 


Asp 


Gly 
250 


Ala 


Thr 


Gin 


Thr 


He 
255 


Ala 


Trp 


Val 


Gly 


Lys 
260 


Gly 


He 


Val 


Tyr 


Asp 
265 


Thr 


Gly 


Gly 


Leu 


Ser 
270 


He 


Lys 


Gly Lys 


Thr 


Thr 


Met 


Pro 


Gly 


Met 


Lys 


Arg 


Asp 


Cys 


Gly 










275 










280 










285 


Gly Ala 


Ala 


Ala 


Val 


Leu 


Gly 


Ala 


Phe 


Arg 


Ala 


Ala 


He 


Lys 


Gin 










290 










295 










300 


Gly 


Phe 


Lys 


Asp 


Asn 


Leu 


His 


Ala 


Val 


Phe 


Cys 


Leu 


Ala 


Glu 


Asn 








305 










310 










315 


Ser 


Val 


Gly Pro 


Asn 


Ala 


Thr 


Arg 


Pro 


Asp 


Asp 


He 


His 


Leu 


Leu 










320 










325 










330 


Tyr 


Ser 


Gly Lys 


Thr 


Val 


Glu 


He 


Asn 


Asn 


Thr 


Asp 


Ala 


Glu 


Gly 








335 










340 










345 


Arg 


Leu 


Val 


Leu 


Ala 


Asp 


Gly 


Val 


Ser 


Tyr 


Ala 


Cys 


Lys 


Asp 


Leu 








350 










355 










360 


Gly Ala Asp 


He 


He 


Leu 


Asp 


Met 


Ala 


Thr 


Leu 


Thr 


Gly Ala 


Gin 










365 










370 










375 


Val 


Ser 


Ala 


Pro 


Trp 


He 


His 


Pro 


Leu 


Ala 


Val 


Val 


Pro 


Gly Asn 










380 










385 










390 


Pro 


Thr 


Pro 


Leu 


Leu 


Thr 


Ser 


Arg 


Trp 


Gly 


Arg 


Gin 


Arg Val 


Gly 










395 










400 










405 


Ala 


Ala 


Val 


Arg 


Lys 


Phe 


Ser 



















410 

<210> 58 

<211> 123 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7510392CD1 
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<400> 58 



Met 


Ala 


Val 


Val 


Pro 


1 








5 


Val 


Gly 


Ala 


Ser 


Ser 
20 


Lys 


Leu 


Leu 


Asn 


Thr 
35 


Val 


Arg 


Tyr 


Gly 


Ser 
50 


Tlir 


Ser 


Val 


Asp 


Asp 
65 


Ser 


Ala 


Thr 


Val 


Cys 
80 


Pro 


He 


Arg 


Leu 


Thr 
95 


His 


His 


Phe 


Thr 


Ser 
110 


Val 


He 


Leu 







Leu 


Leu 


Leu Leu Gly 






10 


Leu 


Gly 


val val Thr 






25 


Arg 


His 


Asn Val Arg 






40 


Gly 


Ser 


Gly Gin Gin 






55 


Ser 


Asn 


Ser Tyr Trp 






70 


Glu 


Arg 


Gly Thr Pro 






85 


His 


Val 


Asn Thr Gly 






100 


Pro 


Leu 


Ser Gly Asn 






115 



Gly 


Leu 


Trp 


Ser 


Ala 










15 


Cys 


Gly 


Ser 


Val 


Val 










30 


Leu 


His 


Ser 


His 


Asp 










45 


Ser 


Val 


Thr 


Gly 


Val 










60 


Arg 


He 


Arg 


Gly 


Lys 










75 


He 


Lys 


Cys 


Gly 


Gin 










90 


Arg 


Asn 


Leu 


His 


Ser 










105 


Gin 


Leu 


Leu 


Cys 


Lys 



120 



<210> 59 
<211> 1389 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7313196CB1 



<400> 59 

ggacggcagg accaagaagc agctcgccafc ggtggaagga accatgaact ggcatctccc 60 
cctcttcctc ttggcctctg tgacgctgcc ttccatctgc tcccacttca atcctctgtc 120 
tctcgaggaa ctaggctcca acacggggat ccaggttttc aatcagattg tgaagtcgag 180 
gcctcatgac aacatcgtga tctctcccca tgggattgcg tcggtcctgg ggatgcttca 240 
gctgggggcg gacggcagga ccaagaagca gctcgccatg gtgatgagat acggcgtaaa 300 
tgatatgatt gacaatctgc tgtccccaga tcttattgat ggtgtgctca ccagactggt 360 
cctcgtcaac gcagtgtatt tcaagggtct gtggaaatca cggttccaac ccgagaacac 420 
aaagaaacgc actttcgfcgg cagccgacgg gaaatcctat caagtgccaa tgctggccca 480 
gctctccgtg ttccggtgtg ggtcgacaag tgcccccaat gatttatggt acaacttcat 540 
tgaactgccc taccacgggg aaagcatcag catgctgatt gcactgccga ctgagagctc 600 
cactccgctg tctgccatca tcccacacat cagcaccaag accatagaca gctggatgag 660 
catcatggtg cccaagaggg tgcaggtgat cctgcccaag ttcacagctg tagcacaaac 720 
agatttgaag gagccgctga aagttcttgg cattactgac atgtttgatt catcaaaggc 780 
aaattttgca aaaataacaa ggtcagaaaa cctccatgtt tctcatatct tgcaaaaagc 840 
aaaaattgaa gtcagtgaag atggtaccaa agcttcagca gcaacaactg caattctcat 900 
tgcaagatca tcgcctccct ggtttatagt agacagacct tttctgtttt tcatccgaca 960 
taatcctaca ggtgctgtgt tattcatggg gcagataaac aaaccctgaa gagtatacaa 1020 
gagaaaccat gcaaagcaac gactactttg ctacgaagaa agactccttt cctgcatctc 1080 
gcttcttttt caaaactagt tcttaggaac agactcgatg caagtgtttc tgttctggga 1140 
ggtattggag ggaaaaaaca agcaggatgg ctggaacact gtactgagga atgaatagaa 1200 
aggcttccag atgtctaaaa gattctttaa actactgaac tgttacctag ggtaacaacc 1260 
cctgfcgagta tttgctgttt gtcgggtcag gaattttgtt ttgtttgtct atatgtgcgg 1320 
ctttacagag aatttatcag tgtacagacc aatatgttta tgtagcttta cttttatgaa 1380 
aaaaattat 1389 



<210> 60 

<211> 4131 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> inisc_f eature 

<223> Incyte ID No: 64652 89CB1 
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<400> 60 

ggcggcggcg gcggagccct gggtcggtgt 
ggcctccgct attgctggag cgcaggcggc 
ttgagctagc gagccgccgc caccctccac 
ggagtcagcg atggtgcccg gcgaggagaa 
tgctgaagaa cctagagtct tatgtattat 
tgaacggatc actttaaatt taccagcatc 
ggccaacaaa gtaggctaca taaatggaac 
tactgctgat atggcaccac tggatcatac 
tgagccagga aagaagaact ttctgcattt 
actgctggag gattccagtg ctggggaaga 
tccaagagaa ggttctgtgg gttctaccag 
atctattttg aataaatcag aaactggata 
ctatttgaat agccttttgc aaacactttt 
taagtgggaa tttgaagaat ctgaagaaga 
aaggcttttt gttttgttac aaaccagcaa 
aaggagcttt ggatgggata gtagtgaggc 
cagagtcatg tttgatgctt tggaacagaa 
aaatgagcta tatcaaggca agctgaagga 
gggctggcga atcgacacat atcttgatat 
ccaagcattt gctagtgtgg aagaagcatt 
tggcccaaat cagtattttt gtgaacgttg 
tcggtttttg cattttcctt atctgctgac 
tacaaccatg cataggatta aactgaatga 
gagtactttt attgatgttg aagatgagaa 
tggagcagaa aatgaaggta gttgtcacag 
tgatggtgtt gatgaaggaa tctgtcttga 
atctggactt gaaaagaatt ccttgatcta 
gagcgctgct ggtggtcatt attatgcatg 
cagcttcaat gatcaacatg tcagcaggat 
tggatcttca ggaagcagag gatattattc 
tatgctgatc tatagactga aggatccagc 
atacccagaa catattaaaa acttggtgca 
gagacaacga gaaattgagc gcaatacatg 
aaaacaagta atgatggaaa ataaattgga 
agtagaaatg gcttataaga tgatggattt 
ccttgttaaa tatgatgagt ttcatgatta 
tacaccaatg gggcttctac taggtggcgt 
ggagacgaga aagcctgatc aggttttcca 
agttcatgtt gttgatctaa aggcagaatc 
cttaaatcag acagttacag aattcaaaca 
tgaaacaatg agaatagtgc tggaacgctg 
cagtaaaacc ctgaaagctg aaggattttt 
cgagactttg gattaccaga tggcctttgc 
gcatgcaaat acaatcagat tatttgtttt 
caaaaggaca gcataccaga aagctggagg 
aagagtcaaa ggacctgtag gaagcctaaa 
tgaaaaactc aaaagcttgt cactgcagca 
caaaagtact gagacaagtg actttgaaaa 
ttcagcatca gtggataata gagaacttga 
ttttcagtct gaagaacgat cagactcaga 
ggacagtgat attcttagct ccagtcatag 
tcagatccct ttggctaatg gacttgactc 
agcaaatgaa gggaaaaaag aaacatggga 
tgaatatgat gagagtggca agagtagggg 
accttatgct gcagatgaag gttctgggga 
taaaagaatt actctggcag ctttcaaaca 
ctctcacttc aaggtctttc gagtgtatgc 
gaatgagaca ctttcatcat tttctgatga 
acttaaaaaa ggagaataca gagttaaagt 
atgcaagttt ctgctagatg ctgtgtttgc 
ggaattaatt cctcagctca gggagcaatg 
tctaaggaaa aaaacatgga agaatcctgg 



ctgcgcgctg gtgtctgagg cccaggctga 60 
ggagaggatg actgccgctg ccattctctc 120 
cctcccccgg cagggcggag aggagcggcc 180 
ccaactggtc ccgaaagaga tagaaaatgc 240 
acaagatact actaattcaa agacagtgaa 300 
tactccagtc agaaagctct ttgaagatgt 360 
ctttgacttg gtgtggggaa atggaatcaa 420 
cagtgacaag tcacttctcg acgctaattt 480 
gacagataaa gatggtgaac aacctcaaat 540 
cagtgttcat gacaggttta taggtccgct 600 
tgattatgtc agccaaagct actcctactc 660 
tgtgggacta gtaaaccaag caatgacttg 720 
tatgactcct gaatttagga atgcattata 780 
tccagtgaca agtattccat accaacttca 840 
aaagagagca attgaaacca cagatgttac 900 
ttggcagcag catgatgtac aagaactatg 9 60 
atggaagcaa acagaacagg ctgatcttat 1020 
ctacgtgaga tgtctggaat gtggttatga 1080 
tccattggtc atccgacctt atgggtccag 1140 
gcatgcattt attcagccag agattctgga 1200 
taagaagaag tgtgatgcac ggaagggcct 1260 
cttacagctg aaaagattcg attttgatta 1320 
tcgaatgaca tttcccgagg aactagatat 1380 
atctcctcag actgaaagtt gcactgacag 1440 
tgatcagatg agcaacgatt tctccaatga 1500 
aaccaatagt ggaactgaaa agatctcaaa 1560 
tgaacttttc tctgtfcatgg ttcattctgg 1620 
tataaagtca ttcagtgatg agcagtggta 1680 
aacacaagag gacattaaga aaacacatgg 1740 
tagtgctttc gcaagttcca caaatgcata 1800 
cagaaatgca aaatttctag aagtggatga 1860 
gaaagagaga gagttggaag aacaagaaaa 1920 
caagataaaa ttattctgtt tgcatcctac 1980 
ggttcataag gataagacat taaaggaagc 2040 
agaagaggta atacccctgg attgctgtcg 2100 
tctagaacgg tcatatgaag gagaagaaga 2160 
caagtcaaca tatatgtttg atctgctgtt 2220 
atcttataaa cctggagaag tgatggtgaa 2280 
tgtagctgct cctataactg ttcgtgctta 2340 
actgatttca aaggccatcc atttacctgc 2400 
ctacaatgat ttgcgtcttc tcagtgtctc 2460 
tagaagtaac aaggtgtttg ttgaaagctc 2520 
agactctcat ttatggaaac tcctggatcg 2580 
gctacctgaa caatccccag tatcttattc 2640 
cgattctggt aatgtggatg atgactgtga 2700 
gtctgtggaa gctattctag aagaaagcac 2760 
acagcaggat ggagataatg gggacagcag 2 820 
catcgaatca cctctcaatg agagggactc 2 880 
acagcatatt cagacttctg atccagaaaa 2940 
tgtgaataat gacaggagta caagttcagt 3000 
cagtgatact ttgtgcaatg cagacaatgc 3060 
tcacagtatc acaagtagta gaagaacgaa 3120 
tacagcagaa gaagactctg gaactgatag 3180 
agaaatgcag tacatgtatt tcaaagctga 3240 
aggacataaa tggttgatgg tgcatgttga 3300 
acatttagag ccctttgttg gagttttgtc 3360 
cagcaatcaa gagtttgaga gcgtccggct 3420 
caataagatt acaattagac tggggagagc 3480 
ataccagctt ttggtcaatg aacaagagcc 3540 
taaaggaatg actgtacggc aatcaaaaga 3 600 
tggtttagag ctcagtattg acaggtttcg 3 660 
cactgtcttt ttggattatc atatttatga 3720 
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agaagatatt aatatttcca gcaactggga ggttttcctt gaagttcttg atggggtaga 3780 

gaagatgaag tccatgtcac agcttgcagt tttgtcaaga cggtggaagc cttcagagat 3840 

gaagttggat cccttccagg aggttgtatt ggaaagcagt agtgtggacg aattgcgaga 3900 

gaagcttagt gaaatcagtg ggattccttt ggatgatatt gaatttgcta agggtagagg 3960 

aacatttccc tgtgatattt ctgtccttga tattcatcaa gatttagact ggaatcctaa 4020 

agtttctacc ctgaatgtct ggcctcttta tatctgtgat gatggtgcgg tcatatttta 4080 

tagggataaa acagaagaat taatggagat tgacagatga ctaaagctat g 4131 

<210> 61 

<211> 3249 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> mis cofeature 

<223> Incyte ID No: 7506357CB1 



<400> 61 

aggcgcgcca gaaagaaggg agaccagcag 
tttgcactgg ctacagcagg gcactggccc 
tatctctgag cgctgcagcc tactgttcag 
cttcgaagct actggatgag gagctgtatt 
ccatgcagag gattcaggga gccagggtcc 
aggtggccaa gaacttggtt ctgatgggtg 
ccacctgctg gtccgacctg gctgcccagt 
gcagagccga ggcctctcaa gagctcttgg 
tgcacacggg tgacatcact gaggacctgc 
ctgcaaagct ggaggagcag ctgaaggtgg 
ttctggcggc tgacacccgg ggcctcgtgg 
tcactgtgca ggaccccaca gaggcagaac 
agggctcccc tggcattctc actctgagga 
gagacttggt gactttctcg ggaattgagg 
ggtctatcca cgtgcgggag gatgggtccc 
ggtacttgcg tggtggggct atcactgaag 
ccctggacac agccctgctc cagccccatg 
atgcccactg cctgcatcag gccttctgtg 
ggccacccca gccctgggat cctgttgatg 
tggaaccact gaagcggaca gaggaagagc 
tgcggacagt cgccctaagc agtgcaggtg 
cagtagctgc ccaggaagtg ctgaaggcaa 
ggctttactt tgatgccctc gattgtcttc 
aggactgtgc cctgagaggc agccgctatg 
ttcaggagaa actgagacgc cagcactacc 
agctgctcaa agtctttgcc ctagtgggac 
ttgttgacat ggaccacata gagcgctcca 
aggacgttgg tagacccaag gcagaggtgg 
acttacaggt gatcccgctc acctacccac 
ataacttttt ctcccgtgtg gatggtgtgg 
gctatgtggc tgctcgttgc acccactatc 
gcacctgggg cagtgctaca gtattcatgc 
cctcagctgc agcttctgag gatgccccct 
gcacagccga gcacaccctg cagtgggccc 
ctgcagagac catcaaccac caccaacagg 
cacagacact caccttactg aagccagtgc 
ggcaagactg tgtggcgtgg gctcttggcc 
aacagctgct gaggcacttc ccacctaata 
ctgccaacct gtatgcccag atgcatgggc 
gggagctgct gaagctgctg ccacagcctg 
gtaatctaga gctggcttcg gcttctgctg 
acaaagccct ggaagtctgg agtgtgggcc 
atgatgacag caacttccat gtggactttg 
actacgggat tccaccggtc aaccgtgccc 
cagccattgc caccactaca gcagctgtgg 



cgtcccttat tcgcttggcc ttggttcctg 60 
ctactgtcac cgccacctac acaaagaccc 120 
ccccaggttt gaggatggat gccctggacg 180 
caagacagct gtatgtgctg ggctcacctg 240 
tggtgtcagg cctgcagggc ctgggggccg 300 
tgggcagcct cactctgcat gatccccacc 360 
ttctcctctc agagcaggac ttggaaagga 420 
ctcagctcaa cagagctgtc caggtcgtcg 480 
tgttggactt ccaggtggtg gtgctgactg 540 
gcaccttgtg tcataagcat ggagtttgct 600 
ggcagttgtt ctgtgacttt ggtgaggact 660 
ccctgacagc tgccatccag cacatctccc 720 
aaggggccaa tacccactac ttccgtgatg 7 80 
gaatggttga gctcaacgac tgtgatcccc 840 
tggagattgg agacacaaca actttctctc 9 00 
tcaagagacc caagactgtg agacataagt 960 
tggtggccca gagctcccag gaagttcacc 1020 
cactgcacaa gttccagcac ctccatggcc 1080 
cagagactgt ggtgggcctg gcccgggacc 1140 
cactggaaga gccactggat gaggccctag 1200 
tcttgagccc tatggtggcc atgctgggtg 1260 
tctccaggaa gttcatgcct ctggaccagt 1320 
cggaagatgg ggagctcctt cccagtcctg 1380 
atgggcaaat tgcagtgttt ggggctggtt 1440 
tcctggtggg cgctggtgcc attggttgtg 1500 
tgggggccgg gaacagcggg ggcttgactg 1560 
atctcagccg tcagttcctc ttcaggtccc 1620 
ctgcagcagc tgcccggggc ctgaacccag 1680 
tggatcccac cacagagcac atctatgggg 1740 
ctgctgccct ggacagtttc caggcccggc 1800 
tgaagccact gctggaggca ggcacatcgg 1860 
cacatgtgac tgaggcctac agagcccctg 1920 
accctgtctg taccgtgcgg tacttcccta 1980 
ggcatgagtt tgaagaactc ttccgactgt 2040 
cacacacttc cctggcagac atggatgagc 2100 
ttggggtcct gagagtgcgt ccacagaact 2160 
actggaaact ctgctttcat tatggcatca 2220 
aagacacaca cctcctctac gtactggcag 2280 
tgcctggctc acaggactgg actgcactca 2340 
acccccaaca gatggccccc atctttgcta 2400 
agtttggccc tgagcagcag aaggaactga 2460 
ctcccctgaa gcctctgatg tttgagaagg 2520 
tggtagcggc agctagcctg agatgtcaga 2580 
agagcaagcg aattgtgggc cagattatcc 2640 
caggcctgtt gggcctggag ctgtataagg 2700 
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tggtgagtgg gccacggcct cgtagtgcct 
actacctcat ccgctatatg ccttttgccc 
ggacctcttg ggaccgtctg aaggtaccag 
tgctggctca tcttcaggag cagcacgggt 
ccctgctcta tgcggccgga tggtcacctg 
tgacagaact ggttcagcag ctgacaggcc 
tgctagagct gagctgtgag ggtgacgacg 
agctgtgaca aggcagccac cctgtcacct 
gcattgtaag cccacagtag gcactcaata 
aaaaaaaaa 



ttcgccacag ctacctacat ctggctgaaa 2760 
cagccatcca gacgttccat cacctgaagt 2820 
ctgggcagcc tgagaggacc ctggagtcgc 2880 
tgagggtgag gatcctgctg cacggctcag 2940 
aaaagcaggc ccagcacctg cccctcaggg 3 000 
aggcacctgc tcctgggcag cgggtgttgg 3060 
aggacactgc cttcccacct ctgcactatg 3120 
agctcaatgg agccccggat cccaagccct 3180 
aatgcttgtt aaaggaaaaa aaaaaaaaaa 3240 

3249 



<210> 62 

<211> 986 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> inisc_f eature 

<223> Incyte ID No: 6878857CB1 



<400> 62 

ggagatagat gagcgtacgt aagctcggaa tcggtcgagc atgaatctac ttctgatcct 60 
tacctttgtt gcagctgctg ttgctgcccc ctttgatgat gatgacaaga tcgttggggg 120 
ctacatctgt gaggagaatt ctgtccccta ccaggtgtcc ttgaattctg gctacaagtc 180 
ccgcatccag gtgagactgg gagagcacaa catcgaagtc ctggagggga atgaacagtt 240 
catcaatgca gccaagatca tccgccaccc caaatacaac agccggactc tggacaatga 300 
catcctgctg atcaagctct cctcacctgc cgtcatcaat tcccgcgtgt ccgccatctc 360 
tctgcccact gcccctccag ctgctggcac cgagtccctc atctccggct ggggcaacac 420 
tctgagttct ggtgccgact acccagacga gctgcagtgc ctggatgctc ctgtgctgag 480 
ccaggctgag tgtgaagcct cctaccctgg aaagattacc aacaacatgt tctgtgtggg 540 
cttcctcgag ggaggcaagg attcctgcca gggtgattct ggtggccctg tggtctccaa 6 00 
tggagagctc caaggaattg tctcctgggg ctatggctgt gcccagaaga acaggcctgg 6 60 
agtctacacc aaggtctaca actatgtgga ctggattaag gacaccatag ctgccaacag 720 
ctaaagcccc tggtccctct gcagtctcta taccaataaa gtgaccctgc tctcacaaaa 7 80 
aaaaaaaaaa ggcgccgcca ataattaatc tcttcaaccc gaaataattc ccgaaccgtt 840 
accttcgcct tgcttggaat ttcattttag ttatcgttac cgtccactcg agggggcccc 900 
gtacccatcg cctatattat gtattaccgt aattggcgtg tttaacttct gttgaactgg 960 
ttacattatc ctgtatcttt caaaaa 986 



<210> 63 

<211> 3665 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7506021CB1 



<400> 63 

tactgccttc aagagatttt tctctctgca gtattcatgg tcagaattct ttgctgatga 60 

caacagcttc tctgcttgga gtctcaggag agtctgtgag gaggaaatga agtgtttagc 120 

gtccgttctg gtcactttag aggacaaagt actaacattt tgagtagcca cattgaactt 180 

ggtcttagca cataaggggt tcatccttga tccacgtttt ctggatttga agagctttct 240 

ttctaccctg gaaatgtctc aactgtctafc tccaagtaac tggccttccc agcttagctt 300 

tcagaatgca caagtgttgg tgtaaatata acttctttaa gcctccaaac ccagtattcc 360 

atttctcaga aaaggctaag tggattcctt tcctccatag aattttcctc tgttttttca 42 0 

tttttctgat gcataatgag atctggatgc atgctgctac cctcgttcct ggcagctgcc 480 

ccgccgctgg ccggcgtcgc ggcagccgct tcagaggaag ctcggcggct gtggccgcgg 540 

cgaacaggcc ggacgcctcg tcgctggcgg ggcttccctt ggagctgacg aaggcggggt 600 

ccgcggtcca ggctgctgcc gcgacggggc cggcggcggg gcagctgcca ggaacgaggg 660 

tagcagctgc atccagatct cattatgcat cagaaaaatg aaaaaacaga ggaaaattct 720 

atggaggaaa ggaatccact tagccttttc tgagaaatgg aatactgggt ttggaggctt 780 

taagaagttt tattttcacc aacacttgtg cattctgaaa gctaagctgg gaaggccagt 840 
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tacttggaat agacagttga gacatttcca gggtagaaag aaagctcttc aaatccagaa 9 00 
aacgtggatc aaggatgaac ccctttgtgc taagaccaag ttcaatgtgg ctactcaaaa 960 
tgttagtact ttgtcctcta aagtgaaaag aaaggacgct aaacacttca tttcctcctc 1020 
aaagactctc ctgagactcc aagcagagaa gctgttgtca tcagcaaaga attctgacca 1080 
tgaatactgc agagagaaaa atctcttgaa ggcagttact gactttccat caaatagtgc 1140 
tttaggtcag gccaatggtc acagacctag gacagaccca caaccttctg actttcccat 1200 
gaagttcaat ggggagagcc aaagtccagt tgagagtggc acgattgtgg tcaccttgaa 1260 
caaccataag agaaagggct tttgttaccg ctgctgccaa gggccggagc accacaggaa 1320 
tgggggaccc ttgattccaa aaaagttcca acttaaccaa catagaagga taaaattatc 1380 
tcctcttatg atgtatgaga aattatccat gattagattt cggtacagga ttctcagatc 1440 
ccagcacttc agaaccaaaa gcaaggtttg caagctaaga aaagcccagc gaagctgggt 1500 
acagaaagtc actggggacc atcaagagac ccgtagggag aacggtgagg gtggcagttg 1560 
cagcccattt ccttccccag aacctaaaga cccttcttgt cggcatcagc cgtactttcc 1620 
agatatggac agcagtgctg tggtgaaggg gacgaactct catgtgcctg attgccacac 1680 
taaaggaagc tctttcttgg gcaaggagct tagtttagat gaagcattcc ctgaccaaca 1740 
gaatggcagt gccacaaacg cctgggacca gtcatcctgt tcttctccta agtgggagtg 1800 
tacagagctg attcatgaca tccccttacc agaacatcgt tctaatacca tgttcatttc 1860 
agaaactgaa agagaaatta tgactctggg tcaggaaaat cagacaagtt ctgtcagtga 1920 
tgacagagta aaactgtcag tgtctggagc agatacatct gtgagtagcg tagatgggcc 198 0 
tgtgtcccaa aaggctgttc aaaatgagaa ctcataccag atggaggagg atggatctct 2040 
caagcagagc attcttagtt ctgagttgct ggaccaccct tactgtaaaa gtccactgga 2100 
ggctcccttg gtgtgcagtg gactcaaact agaaaatcaa gtaggaggtg gaaagaacag 2160 
tcagaaagcc tctccagtgg atgatgaaca gctgtcagtc tgtctttctg gattcctaga 2220 
tgaggttatg aagaagtatg gcagtttggt tccactcagt gaaaaagaag tccttggaag 2280 
attaaaagat gtctttaatg aagacttttc taatagaaaa ccatttatca atagggaaat 2340 
aacaaactat cgggccagac atcaaaaatg taacttccgt atcttctata ataaacacat 2400 
gctggatatg gacgacctgg cgactctgga tggtcagaac tggctgaatg accaggtcat 2460 
taatatgtat ggtgagctga taatggatgc agtcccagac aaagttcact tcttcaacag 2520 
cttttttcat agacagctgg taaccaaagg atataatgga gtaaaaagat ggactaaaaa 2580 
ggtggatttg tttaaaaaga gtcttctgtt gattcctatt cacctggaag tccactggtc 2640 
tctcattact gtgacactct ctaatcgaat tatttcattt tatgattccc aaggcattca 2700 
ttttaagttt tgtgtagagt gtattccaca acagaaaaac gacagtgact gtggagtctt 2760 
tgtgctccag tactgcaagt gcctcgcctt agagcagcct ttccagtttt cacaagaaga 2820 
catgccccga gtgcggaaga ggatttacaa ggagctatgt gagtgccggc tcatggactg 2880 
aaactcagca gggactctgg gaagtctgac caagttggag cagatggttt gttacttgaa 2940 
tctccaaaca cttagttgaa tttttacaga tatttcagat cagtggtgtt gggccactat 3 000 
tgttacctca aatttatttt ttgcccttat tcatttctcc agctaccatg tactattgtt 3 060 
taatgttcag tttggtttca tttttaattt tatggttctg tgcgtccccc atatttaata 3120 
tttattattc aaacgcatgc atatagacag agcatgcagt gaagagtatt aaaaaaaaaa 3180 
gcttagtaga tttggtgcag cttttgaaac ttagttagac gtgaactgaa tacaggtttc 3240 
aaatttactc ccagaaccta aaaatgcaag atgtttttga tacaacataa ctctgagaat 3300 
agtaagtgtt ccctggggca ttaagggtag ctgggggtgg ttttgacaaa tccagtcctg 3360 
ttttacttta ccagcggcaa ctttcaccaa cttcccctct ccaagtgagt cttagagagt 3420 
gcagtccatt ccttttgaag ggtgagatgg aagtggtcgt aaactgactg gtgtcttctg 3480 
tttctggagg cacacttgta agcacagtgg ctgctttggg aggagtaagg tgtgagaaaa 3540 
agcaaccttg gaggccagta acaatgacag atttcaatcg tggttttagg aattataata 36 00 
cgtggcatac atctcataaa ggcttttgct gggaaaaaaa aaaaaaaaaa aaaaaaaaga 3 660 
tcttt 3 665 



<210> 64 
<211> 1440 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: 7503356CB1 

<400> 64 

cggacgcgtg ggcggacgcg tgggtgccca 
ggagcagcag agactcctca ggaagactca 
tgtggccacc tttccgatgc agatggaagc 
cacagcaacc ccaacaggac tttgaggccc 



ggcgcttaaa gaagcaaaat ctcttgtgca 60 
ctggactgta cccaccacct gccatgtctc 120 
tggcgccaag gtactctagg agggcgtctc 180 
tgctggcaga gtgcctgagg aatggctgcc 240 
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tctttgaaga caccagcttc ccggccaccc tgagctccat cggcagtggc tccctgctgc 3 00 
agaagctgcc accccgcctg cagtggaaga ggcccccgga gctgcacagc aatccccagt 3 60 
tttattttgc caaggccaaa aggctggatc tgtgccaggg gatagtagga gactgctggt 420 
tcttggctgc tttgcaagct ctggccttgc accaggacat cctgagccgg gttgttcccc 480 
tgaatcagag tttcactgag aagtatgctg gcatcttccg gttctgggct ctctggttcc 540 
tatgaagact tgcagtcagg acaggtgtct gaagcccttg tagacttcac tggaggggtg 600 
acaatgacca tcaacctggc agaagcccat ggcaacctct gggacatcct catcgaagcc 660 
acctacaaca gaaccctcat tggctgccag acccactcag gggagaagat tctggagaat 720 
gggctggtgg aaggccatgc ctatactctc acaggaatca ggaaggtgac ctgcaaacat 780 
agacctgaat atctcgtcaa gctacggaac ccctggggaa aggtggaatg gaaaggagac 840 
tggagtgaca gttcaagtaa atgggagctg ctgagcccca aggagaagat tctgcttctg 900 
aggaaagaca atgacggaga attctggatg acgctgcagg actttaaaac acatttcgtg 960 
ctcctggtta tctgtaaact gaccccaggc ctgttgagcc aggaggcggc ccagaagtgg 102 0 
acgtacacca tgcgggaggg gagatgggag aagcggagca cagctggtgg ccagaggcag 1080 
ttgctgcagg acacattttg gaagaacccg cagttcctgc tgtctgtctg gaggcccgag 1140 
gagggcagga gatccctgag gccctgcagc gtgctggtgt ccctgctcca gaagcccagg 1200 
cacaggtgcc gcaagcggaa gcctctcctc gccattggct tctacctcta taggatgaac 1260 
aagtaccatg atgaccagag gagactgccc cctgagttct tccagagaaa cactcctctg 1320 
agccagcctg ataggtttct caaggagaaa gaagtgagtc aggagctgtg tctggaacca 1380 
gggacgtacc tcatcgtgcc tgcatattgg aggcccacca gaagtcagag ttcgtcctca 1440 



<210> 65 

<211> 1064 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc^feature 

<223> Incyte ID No: 7509052CB1 . 



<400> 65 

gacgccaccg ccgaggaaaa ccgtgtacta 
gacattgccg tcgacggcga gcccttgggc 
accaagaagt gactgctcat ctaatccata 
attttcatta caagtgacta aaagaacagc 
ctgttcctta gaattttgcc ttctgtttgc 
tcgtgctctg agcactggag agaaaggatt 
tattccaggg tttatgtgtc agggtggtga 
gtccatctat ggggagaaat ttgaagatga 
catcttgtcc atggcaaatg cfcggacccaa 
tgccaagact gagtggttgg atggcaagca 
gaatattgtg gaggccatgg agcgctttgg 
caccattgct gactgtggac aactcgaata 
gatcattcct tctgtagctc aggagagcac 
gaatctttgt gctctcgctg cagttccctt 
tgcctagctg gattgcagag ttaagtttat 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 
aaaaaaaaaa aaaaaaaggg ggcccccttt 



ttagccatgg tcaaccccac cgtgttcttc 60 
cgcgtctcct ttgaggtaag gggcctggat 120 
aagctatgtt aacagattgg aggtagtagc 180 
tgtttacccc tgatcgtgca gcagtgcttg 240 
agacaaggtc ccaaagacag cagaaaattt 300 
tggttataag ggttcctgct ttcacagaat 360 
cttcacacgc cataatggca ctggtggcaa 420 
gaacttcatc ctaaagcata cgggtcctgg 480 
cacaaatggt tcccagtttt tcatctgcac 540 
tgtggtgttt ggcaaagtga aagaaggcat 600 
gtccaggaat ggcaagacca gcaagaagat 660 
agfcttgactt gtgttttatc ttaaccacca 720 
ccctccaccc catttgctcg cagtatccta 780 
t-gggttccat gttttccttg ttccctccca 840 
gattatgaaa taaaaactaa ataacaaaga 900 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 960 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaag 1020 
agggggcccc cctt 1064 



<210> 66 

<211> 4497 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7503366CB1 



<400> 66 

cgcggctggg cgcctcggcc atgactgcgg 
cagacggcca cggctcgagc tgccaaatgc 
ttcaggaccc ttcctttctc catgaagctc 



agctgcagca ggacgacgcg gccggcgcgg 60 
tgttaaatca actgagagaa atcacaggca 120 
tgaaggccag taatggtgac attactcagg 180 
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cagtcagcct 
aaccatctga 
ttactcatga 
ccaaaattca 
ctaaacgctc 
ggaggagagt 
gtgctgttat 
gtctgccaca 
ttatgcaaga 
acccgtctgc 
aagatgtgag 
ctgttaatgt 
atggtacttt 
tcggccagta 
ccatggtgga 
agcgttggtt 
atcagtccct 
atatggacag 
gaaagttgaa 
gctcaggccc 
gtacaaaacc 
tttcttcagt 
gctcttctca 
aaccactgac 
tcacagatga 
aacaagatat 
tgtactgcga 
ctattgggcc 
tgttactgaa 
tagtgcttac 
cccaactgaa 
cattcaggag 
ttgcaaaatc 
acaagagcct 
gattgtaaag 
cggtgtagaa 
tgccaaagag 
tttccaaaat 
taaaaatctt 
gaaggaaatt 
tagtttgttc 
aaagtgccaa 
gatgaagggg 
Gcttctggag 
aactgagggc 
taatgacatt 
ttaccttggg 
acttctagat 
ttctccctat 
aactgtgaca 
gcctcttgca 
cataagattc 
ttgcactttt 
tattaaggtg 
ggcttttcta 
gcagaataga 
tcaaatacag 
ctggctcttg 
ggtagaatat 
tcctttcttt 
aaagtattaa 
atagttatta 

tagggctact 



tctcactgat 
agtagagggg 
taacaaagat 
agctgatgga 
aaagagaaaa 
tgatggttgg 
tcagtctctc 
aaatgtactt 
gcttcagtat 
agccctggat 
tgaattcaca 
taacagcagt 
cctgactgaa 
tcctcttcag 
gggtgatgtt 
tacaaagcta 
tgggcagcca 
gtacatgtac 
ggaggaaata 
agctcggttc 
tgcctcagaa 
gcactgctcg 
ggatgttgaa 
atcttctcgg 
ggagataaat 
acaagattta 
tcctctcctt 
tatatctata 
tcttcctggg 
tgtctgatgt 
tcagatcaaa 
gataactggc 
cctcaaatgg 
tcagtagcct 
gagcaaactg 
gcggcactga 
acccccacct 
gaagcaccca 
agctatgatg 
ggtccagatg 
cgaaaagtgt 
gaggcacttt 
ccccgccggg 
ctgaatgcca 
attaatgtga 
tccaaggatg 
caagatattg 
ccttctgcag 
gacctatgta 
gtgaaataag 
cagaagttcg 
cgatcagacc 
taggtacaaa 
ttgaattacg 
agcctcttgc 
agaatgattc 
atgacattta 
ccaggagata 
aagctcacat 
tctatcttta 
ggactttaga 
ggcatttagg 
gctggcccaa 



gagagagtta 
agtgctgcca 
gatcttcagg 

agagatctta 

cgctgtgaag 

ccagttgggc 

tttcaattgc 

gaaaattgtc 

ttgtttgctc 

ctattaaagg 

cacaagctcc 

cccaggaaca 

ggggttcgtg 

gtaaacggtt 

gagcttcttc' 

cctccagtgt 

gagaaaattc 

aggagcaagg 

aaaattctgc 

ccgctcccgg 

agctgtccac 

gtttctgacc 

agtacctttt 

tcttccatgg 

tttgttaaga 

aagacttgta 

cgtcagttct 

atcaaccccg 

aagaagttga 

acattaatga 

tgtcagaagt 

ggtttgagca 

agtcctccac 

cttctcatgg 

cccaggctat 

gtgaggcatt 

ctcacagtga 

aaagggtagt 

aaagatcaat 

acatgaatat 

ctgtgtatct 

cctacctggt 

gggtcaaaga 

aagcagcttc 

tgaatgaact 

atctggatgc 

cagaaaatct 

aaatcatcgt 

gccgatttgc 

ctcccacatg 

ttgtcatagt 

ccaaccatgc 

atcataaaag 

ccagaagacc 

cacttttaaa 

atgagaacct 

aaaattctta 

acaaaatggg 

ggcagtgaga 

aaccagaaaa 

cagaagagca 

taaaattttc 

agaccattta 



aggagcccag 
acaaggaagt 
ctgccattgc 

acaggatgca 
tctggggaga 
tgaaaaatgt 
ctgaatttcg 
gaagtcatac 
taatgatggg 
gagcattccg 
tggattggct 
aatctgaaaa 
aaggaaaacc 
atcgcaactt 
cctccgatca 
tgacctttga 
acaataagct 
agcttattcg 
agcaaaaatt 
acatgctgaa 
ctgaaagtga 
agacatccaa 
cttctcctga 
aaatgccttc 
cctgtcttca 
ttgcaagtac 
tgttcatgaa 
acagagctgg 
aagagattcc 
caaactaccc 
ggaagcccta 
ggaagtagag 
caactcctca 
ggttcgctgc 
tgcaaacaca 
ccatgaagaa 
tcctcgactt 
agaacgaacc 
cagcattatg 
ggaagagtac 
cctaacaggc 
atatgcctac 
atccgtgatt 
tctttttgaa 
gatcatcccc 
cattgaggtc 
gcagctgtgc 
cttgaaagag 
agctgtcatg 
ttcaaggccc 
gctcaccttg 
tgcatgtgta 
Gtgtttcact 
tgaaatgcct 
attatccttc 
gaaccagatg 
actacaagag 
ttgctgatga 
tgttgaaaag 
cctcatactc 
ttgtgtaact 
taatacctaa 
ggtccacctc 



tcaagacact 
attagcaaaa 
tttgagtcta 
tgaagcaacc 
aaaccccaat 
tggcaataca 
aagacttgtt 
agaaaagaga 
atcaaataga 
atcatctgag 
agaggacgca 
tccaatggtg 
cttttgtaac 
agacgagtgt 
Gtcggtgaag 
actctcaaga 
ggaatttcct 
aaataagaga 
ggaaaggtat 
atatgttatt 
cacacatatg 
ggaaagtaca 
agattcttta 
acagccagct 
gagatggagg 
tactcagact 
ggacaagcaa 
ctcaagtaca 
tatggaggcc 
tacttcaatg 
tctgtggaac 
gagtgggaag 
tcacaggact 
ttgtcatctg 
gcccgtgcct 
tactccaggc 
cagcatgtcc 
cttctggaac 
aaggtggctc 
aagaagtggc 
ctagaactct 
cagagcaatg 
gctttatacc 
acaaatgatg 
tgcattcacc 
atgagaaacc 
ctaggggagt 
cctccaacta 
gagtcaattc 
attctggttc 
ggaaaaggat 
aagaaggatt 
agaaaaggca 
tgtacctaca 
aggcataaat 
aacagctact 
attagaaata 
actgcaccct 
tcaaaagaga 
agtcctcagt 
tgactgaaga 
aaattgtcaa 
ctcttttttg 



gttgctacag 
gttatagacc 
ctggagtctc 
tctgcagaaa 
cccaatgact 
tgttggttta 
ctcagttata 
aatatcatgt 
aaatttgtag 
gaacagcagc 
ttccagctag 
cagctgttct 
aatgagacct 
ttggaagggg 
tatggacaag 
tttgagttta 
cagattattt 
gagtgtattc 
gtgaaatatg 
gaatttgcta 
acattaccac 
agtacagaaa 
cccaagtcta 
ccacgaacag 
agtgagattg 
attgaacaga 
atgctggaca 
atgacatctc 
tgagaaatgt 
cagaggcagc 
tcaagcatta 
aagagcagtc 
actctacatc 
agcatgctgt 
atgagaagag 
tctatcagct 
ttgtctactt 
agtttgcaga 
aagcgaaact 
atgaagatta 
atcaaaaagg 
ctgccctgct 
gaagaaaatg 
atcactccgt 
ttatcattaa 
attggtgctc 
ttctacccag 
ttcgacccaa 
agggagtttc 
ctggctgcct 
taggtgggca 
gaaaataaaa 
gaaagcagtg 
acaatgctta 
atttttgaca 
agttatttta 
taaaccttgc 
tttacatgtg 
cctgtctctc 
gaaagaaagt 
tcatccatta 
aaacagtcaa 
ctcttttttt 



240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 



75/112 



wo 03/063688 



PCT/US03/02500 



ttttctgtga cagtttcact 
actgcaaact ctgtctcctg 
gaattacggg catgcaccac 
ttcaccatat tggccaggct 
tcagcctccc aaagtctggg 
ttaaaagtaa gggatgtggg 
gcgcgattat tgagtctctt 
gttcccagtt tcctaatggt 
ctgtgtgaat tgtttcgttc 

<210> 67 
<211> 1247 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<223> Incyte ID No: 7505933CB1 

<400> 67 

cggctcgagc cgtcccggag acccggtcgg gagggaggaa ggtggcaaga tggtgttgga 60 
aagcactatg gtgtgtgtgg acaacagtga gtatatgcgg aatggagact tcttacccac 120 
caggctgcag gcccagcagg atgctgtcaa catagtttgt cattcaaaga cccgcagcaa 180 
ccctgagaac aacgtgggcc ttatcacact ggctaatgac tgtgaagtgc tgaccacact 240 
caccccagac actggccgta tcctgtccaa gctacatact gtccaaccca agggcaagat 300 
caccttctgc acgggcatcc gcgtggccca tctggctctg aagcaccgac aaggcaagaa 360 
tcacaagatg cgcatcattg cctttgtggg aagcccagtg gaggacaatg agaaggatga 420 
ggtgaacaca gaaaagctga cagcctttgt aaacacgttg aatggcaaag atggaaccgg 480 
ttctcatctg gtgacagtgc ctcctgggcc cagtttggct gatgctctca tcagttctcc 540 
gattttggct ggtgaaggtg gtgccatgct gggtcttggt gccagtgact ttgaatttgg 600 
agtagatccc agtgctgatc ctgagctggc cttggccctt cgtgtatcta tggaagagca 660 
gcggcagcgg caggaggagg aggcccggcg ggcagctgca gcttctgctg ctgaggccgg 720 
gattgctacg actgggactg aagactcaga cgatgccctg ctgaagatga ccatcagcca 780 
gcaagagttt ggccgcactg ggcttcctga cctaagcagt atgactgagg aagagcagat 840 
tgcttatgcc atgcagatgt ccctgcaggg agcagagttt ggccaggcgg aatcagcaga 900 
cattgatgcc agctcagcta tggacacatc cgagccagcc aaggaggagg atgattacga 960 
cgtgatgcag gaccccgagt tccttcagag tgtcctagag aacctcccag gtgtggatcc 1020 
caacaatgaa gccattcgaa atgctatggg ctccctggcc tcccaggcca ccaaggacgg 1080 
caagaaggac aagaaggagg aagacaagaa gtgagactgg agggaaaggg tagctgagtc 1140 
tgcttagggg actgcatggg aagcacggaa tatagggtta gatgtgtgtt atctgtaacc 1200 
attacagcct aaataaagct tggcaacttt ttttcaaaaa aaaaaaa 1247 

<210> 68 

<211> 714 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7507064CB1 

<400> 68 

atgaggggcc ttgtggtatt ccttgcagtc tttgctctct ctgaggtcaa tgccatcacc 60 
aggtctccaa cattgtggac ccccaccaga ctgtgggtct gagcacccag gaacctggcg 120 
acgtcttcac ctactccgag tttgatggga tcctggggct ggcctatccc tctcttgcct 180 
ctgagtagtc agtgccagtg tttgacaaca cgatgcagag gcacctggtg gcccaagacc 240 
tgttctcagt ctacatgagt aggaatgacc aggggagcat gctcacgctg agggccattg 300 
atctgtcgta ctacacaggc tccctgcact ggatacccat gactgcaaga atactggcag 360 
ttcactgtgg acagtttgac atcgactgcg ggcgcctgag cagcattccc acggctgtct 420 
tcgagatcca cggcaagaag taccccctgc caccctccgc ctataccagc caggaccagg 480 
gcttctgcac cagtggtttc cagggtgact atagttccca gcagtggatc ctggggaatg 540 
tcttcatctg ggagtattac agtgtctttg acaggaccaa taaccgtgtg gggctggcga 600 
aggctgtctg attgcatcac tggccatgga cctcaatgtg accaaacaca cacacgcaca 660 



gtgtcgccca ggctggcgtt cagtggtgca ate t cage tc 4020 
ggctcaagtg attctcgtgc ctcagcctcc cgaatagctg 4080 
cacacctggc taatttttgt atttttaata gagatggggt 4140 
gatctctaac tcctggcctc aagtgatcta tctgcctccc 4200 
attgcagaca agtcatcgta cccggccttc ttttttgccc 42 60 
tttgtacaaa aaaaaaaaaa aaaaaaaaag aggggcggcc 4320 
gcaacccgcg aatttatttc cgaaccggtt acctgagggc 4380 
gagtcgtttt acagcttgta gtaatcatga acaagctgtc 4440 
catccacata tcacacacac aacacggacg gaagacg 4497 
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tagatgagat gtgcaggcag atggttccca ataaacaccg catttctcca aaaa 



714 



<210> 69 

<211> 1008 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> inisc_feature 

<223> incyte ID No: 1439986CB1 



<400> 69 

agctagccag 

gatgcccaga 

aatgagccgt 

gaccccatga 

cctttggagt 

gaaaaggtgt 

gaaggaagtg 

gacccccaag 

aaaaaaatta 

gacaagatcc 

gagaaggaag 

gagaagctac 

cctcttggca 

ctcaacatcc 

gggatgccat 

cagttcccca 

tgactgtggt 



ccatgggcca 
tcaagcaggc 
cttcagcaga 
agagaggcat 
ttggatccca 
tccacgagtt 
aggtagattt 
tcgaacggga 
agatctccag 
tgaccattga 
gggaccaggg 
accctcgctt 
aggctctcac 
ccatcaatga 
tgccggagga 
cccgcctcac 
gggctggagc 



ggattattac 
gtaccgcaga 
gattttcagg 
ctacgacaag 
gaccccatgg 
ctttggtgga 
gaacfcttggg 
tctctacctg 
aagggtgctg 
tgtgaagccc 
ccccaacatc 
ccgcagggag 
ctgctgcact 
catcatccac 
ccccactaag 
accccagaag 
aggggtgaga 



tctgtgctcg 
ctcgccctta 
caaatagcag 
tttggagaag 
acaactggtt 
aacaacccct 
gggctccagg 
tccctggagg 
aacgaggatg 
ggttggaggc 
atcccagcag 
aatgacaacc 
gtggaggtga 
cccaaatact 
aaaggggatc 
aagcagatgc 
ggaggctagc 



ggatcactcg 
agcaccaccc 
aggcctacga 
agggcctgaa 
acgtcttcca 
tcagtgagtt 
gccgaggggt 
acttattctt 
ggtactcctc 
agggcacacg 
acatcatttt 
tcttcttcgt 
ggaccctaga 
tcaagaaggt 
tcttcatctt 
tgcgccaggc 
cgggcctc 



caattcagag 
gttgaagtca 
cgtgctgagt 
gggtgggatt 
cggcaaacct 
ttttgatgca 
caagaagcag 
tggctgcacc 
caccatcaag 
catcaccttt 
catcgtaaag 
gaaccccatc 
tgaccgtctg 
gccaggggag 
cttcgacatc 
attgctgaca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1008 



<210> 70 

<211> 2425 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<223> Incyte ID No: 



2008979CB1 



<400> 70 

ccgggccccg 

tgtcactatg 

tatcggaagc 

gctgaacaat 

gcatggtatg 

gatgacagct 

gatgaaaagg 

ctagaatctg 

gactatgata 

aattttgcat 

gccatggaaa 

gtccgtcagc 

cttgtggaag 

aagctaaagc 

aatttggaga 

tcggatgaaa 

gatgaggccg 

ttcaagacag 

gccttgctaa 

gaaaatccat 

tctaaaaaac 

gtaaatggac 

agtgccaaag 



accccgtccc 
aggcgctggg 
tggccctgaa 
ttaaattaat 
ataatcatag 
tagatttgct 
gattttacac 
tgttagagga 
cggtagtcca 
ggaaggaaga 
aagaaaacaa 
tggtagcttt 
aacagaatgc 
aggccaaact 
aagagctcca 
atgaaatgga 
aggacgcfcga 
aaaaggccat 
aacaacagct 
tagatgacaa 
agaagaaaaa 
ctggagaagg 
aattggaaga 



gggccccagc 
ggtgcggcgc 
atggcacccg 
ccaagcagca 
agaggcccta 
acgctatttc 
ggtgtatcgt 
agaggttgat 
tcctttctac 
atatgataca 
aaagattcgg 
cattcgtaaa 
agagaaggcg 
ggtggagcag 
ggagatggag 
agaacatgaa 
gctctatgat 
gaagaatcac 
ggaggaggaa 
ttctgaggaa 
gaaacagaaa 
agtaaaggtt 
tagtccccag 



gccggccgcc 
gacgccagcg 
gataaaaatc 
tatgatgtgt 
cttaaaggtg 
accgttacct 
aatgtttttg 
gatttcccaa 
gcttattggc 
cgacaggctt 
gacaaagcaa 
agagataaaa 
aggaaagccg 
tacagagaac 
gcacggtacg 
ctcaaagatg 
gacctttact 
gagaagtcaa 
gaagaaaatt 
gaaatggaag 
ccagcacaga 
gatccagaag 
gaaaatgtca 



cgcccggtcg 
aggaggagct 
tggataatgc 
tgagtgaccc 
ggtttgatgg 
gttattctgg 
aaatgattgc 
cttttggaga 
agagtttctg 
caaaccgctg 
ggaaagagaa 
gagtgcaggc 
aagagatgag 
agagctggat 
agaaggagtt 
aggaggatgg 
gcccagcatg 
agaagcatcg 
tttcaagacc 
atgcaccaaa 
attatgatga 
atactaactt 
gtgtcacaga 



ggcgatgaag 
caagaaggcc 
cgcagaagca 
tcaggaaaga 
cgaatatcaa 
ttatggagat 
caaggaagaa 
ctcccagagt 
cactcaaaag 
ggaaaaacga 
gaatgagctt 
gcatcgaaaa 
gcggcagcag 
gactatggcc 
tggagatgga 
taaagacagt 
tgacaaatcg 
ggaaatggtg 
tcaaattgat 
acaaaagctt 
caatttcaat 
aaatcaagac 
gatcattaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 
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ccatgtgatg atccaaaaag tgaagctaaa agtgttccta aacccaaagg aaagaaaacc 1440 
aaagatatga aaaaacctgt cagagtacct gctgaaccac aaacaatgag tgttcttatc 1500 
agctgtacaa cctgccatag tgaatttcca tctcggaata aactttttga ccatctaaag 1560 
gccacaggtc atgcaagagc accttcatca tcgtctttaa acagcgcaac aagtagtcaa 1620 
agcaagaaag agaaacgtaa aaacagatag agattctgcc tgtgcttttg tttgactgtc 1680 
tctagatttt gaaaccaaaa aactgaactg aaatcatcta aagagttaaa atttcagtga 1740 
tctgcaatta attacattgt ggaagattat tttttatctt gtaaaaacac ttttttggtt 1800 
taatatatat ttttaaaaca tttcactagt gattgaattc tacttttgcc atctgaattg 1860 
acttgaatgt cttaaaacag gtaaatactg taaagtgtgt attcttgatg tttattggct 1920 
catgtggaca gaaatgtaca gggagaatta cattatttta acacacagaa gtgcaacttt 1980 
ctgctttatt ttctgaattt cacattactt ttacttaatg cttttgtgtt ttgttaatac 2040 
ttcataatat gtgaaaaact cggatctttt aaaaagcatc atagatcatt tttccatatg 2100 
acactggttc cgattttaaa aattattttt aaataaccga ttattgatta ctgtattttt 2160 
tttctcaaga acagtgatag gtagaaacta attgaacatt tggtagtctt tcaagaatag 2220 
tgtctcttca aggttttact tgatttaatt tgatatttta ctggtttacc agtaaggtgt 2280 
attgttcagt tttttgctcc gatttgaatt gtggaggtgg aagcaaatta gtttacatgg 2340 
catgtcctcc ctaggcacag tgacagctgt aaagtatgac ggaacaaggt agcagatggt 2400 
acagaattta tactatttaa gaaga 2425 

<210> 71 

<211> 856 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 90073157CB1 

<400> 71 

gaaccactcc ggcccgctag acctgctgct atgtcccggc cgcagcttcg acgctggcgc 60 
ctcgtctcta gcccgccgag cggcgtcccg ggtctagcgc tgctggcgct gctggcgctg 120 
ctggcgctgc ggctcgcggc cgggaccgac tgcccatgcc cggagcctga gctctgccgc 180 
ccgattcgcc accatccaga tttcgaggtc tttgtgtttg atgttggaca gaaaacttgg 240 
aaatcttatg attggtcaca gattacaact gtggcaacat ttggaaaata tgactcagaa 300 
cttatgtgct acgctcattc aaaaggagcc agagtagtac ttaaaggtaa cctttgatgt 360 
agcttggtct ccaaagaaca tagacagaag atgctataat tatactggaa tcgcagatgc 420 
ttgtgacttc ctctttgtga tgtcttatga tgaacaaagt cagatctggt cagaatgtat 480 
tgcagcagcc aatgctccct ataatcagac attaactgga tataatgact acatcaagat 540 
gagcattaat cctaagaaac ttgtaatggg tgttccttgg tatggttatg attatacctg 600 
cctgaatctg tctgaggatc atgtttgtac cattgcaagt atggtatgat aaccctcaga 660 
gtatttcttt aaaggcaaca tatatacaaa actatcgctt acggggcatt ggcatgtgga 720 
atgcaaactg tcttgactac tctggagatg ctgtagccaa acagcaaact gaagaaatgt 780 
gggaagtctt aaagccaaag ctgttacaga gatgaacatc ttttgtcaaa ccattaagag 840 
ttagaaagat gatctg 856 

<210> 72 

<211> 1318 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 7506782CB1 

<400> 72 

ggaggcttga aactgttaca atatggcttt cctgggactc ttctctttgc tggttctgca 60 

aagtatggct acaggggcca ctttccctga ggaagccatt gctgacttgt cagtgaatat 120 

gtataatcgt cttagagcca ctggtgaaga tgaaaatatt ctcttctctc cattgagtat 180 

tgctcttgca atgggaatga tggaacttgg ggcccaagga tctacccaga aagaaatccg 240 

ccactcaatg ggatatgaca gcctaaaaaa tggtgaagaa ttttctttct tgaaggagtt 300 

ttcaaacatg gtaactgcta aagagagcca atatgtgatg aaaattgcca attccttgtt 3 60 

tgtgcaaaat ggatttcatg tcaatgagga gtttttgcaa atgatgaaaa aatattttaa 420 

tgcagcagta aatcatgtgg acttcagtca aaatgtagcc gtggccaact acatcaataa 480 
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gtgggtggag aataacacaa acagggaatt 
ctaccaagtc ctagaaatac catatgaagg 
cagacaggaa gttcctcttg ctactctgga 
atgggcaaac tctgtgaaga agcaaaaagt 
acaggaaatt gatttaaaag atgttttgaa 
agatgcaaat ttgacaggcc tctctgataa 
caagtccttc ctagaggtta atgaagaagg 
tgcaattagt aggatggctg tgctgtatcc 
tcttatcaga aacaggagaa ctggtacaat 
aacaatgaac acaagtggac atgatttcga 
aggaaaacag taactaagca cattatgttt 
tacagtatat cttaagataa tatttaaaat 
ataagtaact tgtcaaggaa tgttatcagt 
gtttgtgtgc tgttgtttaa ataaaagtac 

<210> 73 
<211> 2251 
<212> DNA 

<213> Homo sapiens 



tagtgatggc tccaatgaag ctggtggtat 540 
agatgaaata agcatgatgc tggtgctgtc 600 
gccattagtc aaagcacagc tggttgaaga 660 
agaagtatac ctgcccaggt tcacagtgga 720 
ggctcttgga ataactgaaa ttttcatcaa 780 
taaggagatt tttctttcca aagcaattca 840 
ctcagaagct gctgctgtct caggaatgat 900 
tcaagttatt gtcgaccatc catttttctt 960 
tctattcatg ggacgagtca tgcatcctga 1020 
agaactttaa gttactttat ttgaataaca 1080 
gcaactggta tatatttagg atttgtgttt 1140 
agttccagat aaaaacaata tatgtaaatt 1200 
attaagctaa tggtcctgtt atgtcattgt 1260 
ctattgaaaa aaaaaaaaaa aaaaaaaa 1318 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 7506941CB1 



<400> 73 

cctggagtca gcttaaaaag ctgcttgccg 
ctaaggcacc acatgatttc atgtttgtga 
acagaatcta ttaccttgcc atgtctggtg 
aaattcccaa aactatcaat agagcagcag 
atctttttca ggcaacactg gactatggaa 
aaagaaaacg cattggaaca gtcggaattg 
catttctgtt tcaagccggt agtggaattt 
ttacgcaaca acctttaagg cccaatctag 
atccaaaatt atgccctgct gatccagact 
ggatatctaa catcgtaacc agagaagaaa 
ccaacatgga agaagatgcc agatcagctg 
ttgatagata ttctggctat tggtggtgtc 
aaattcttag aattctatat gaagaaaatg 
catcccctat gttggaaaca aggcagggca 
gcaaatccta aagtcacttt taagatgtca 
catgacatct ttcatgtttt tccccaaagt 
tctgaatgca aaacaggttt ccgtcattta 
aaatataaac gatccagtgg tgggctgcct 
gaggagatag caattaccag tggtgaatgg 
caagttgatg aagtcagaag gctggtatat 
catcacctgt acgtagtcag ttacgtaaat 
ggctactcac attcttgctg catcagtcag 
aaccagaaga atccacactg tgtgtccctt 
acttgcaaaa caaaggaatt ttgggccacc 
tatactcctc cagaaatttt ctcttttgaa 
ctctacaagc ctcatgatct acagcctgga 
ggtggtcctc aggtgcagtt ggtgaataat 
aataccctag cctctctagg ttatgtggtt 
cgagggctta aatttgaagg cgcctttaaa 
gtcactctgt ggatcttcta tgatacagga 
cagaatgaac agggctatta cttaggatct 
gaaccaaatc gtttactgct cttacatggt 
accagtatat tactgagttt tttagtgagg 
cctcaggaga gacacagcat aagagttcct 
ttgcactacc ttcaagaaaa ccttggatca 
tgacctgtgt agaactctct ggtatacact 
aacagaaaac acagaattga tcatcacatt 
aaaataaatg tggtgccatg aaaaaaaaaa 



ataccagaaa atatcatggc tacatgatgg 60 
agaggaatga tccagatgga cctcattcag 12 0 
agaacagaga aaatacactg ttttattctg 180 
tcttaatgct ctcttggaag cctcttttgg 240 
tgtattctcg agaagaagaa ctattaagag 300 
cttcttacga ttatcaccaa ggaagtggaa 3 60 
atcacgtaaa agatggaggg ccacaaggat 420 
tggaaactag ttgtcccaac atacggatgg 480 
ggattgcttt tatacatagc aacgatattt 540 
ggagactcac ttatgtgcac aatgagctag 600 
gagtcgctac ctttgttctc caagaagaat 660 
caaaagctga aacaactccc agtggtggta 720 
atgaatctga ggtggaaatt attcatgtta 780 
gattcattcc gttatcctaa aacaggtaca 840 
gaaataatga ttgatgctga aggaaggatc 900 
cacgaagagg aaattgagtt tatttttgcc 960 
tacaaaatta catctatttt aaaggaaagc 1020 
gctccaagtg atttcaagtg tcctatcaaa 1080 
gaagttcttg gccggcatgg atctaatatc 1140 
tttgaaggca ccaaagactc ccctttagag 1200 
cctggagagg tgacaaggct gactgaccgt 126 0 
cactgtgact tctttataag taagtatagt 132 0 
tacaagctat caagtcctga agatgaccca 1380 
attttggatt cagcaggtcc tcttcctgac 1440 
agtactactg gatttacatt gtatgggatg 1500 
aagaaatatc ctactgtgct gt tea tat at 156 0 
cggtttaaag gagtcaagta tttccgcttg 162 0 
gtagtgatag acaacagggg atcctgtcac 1680 
tataaaatgg ttgctattgc tggggcccca 1740 
tacacggaac gttatatggg tcaccctgac 1800 
gtggccatgc aagcagaaaa gttcccctct 1860 
ttcctggatg agaatgtcca ttttgcacat 1920 
gctggaaagc catatgattt acagatctat 1980 
gaatcgggag aacattatga actgcatctt 2040 
cgtattgctg ctctaaaagt gatataattt 2100 
ggctatttaa ccaaatgagg aggtttaatc 2160 
ttgatacctg ccatgtaaca tctactcctg 2220 
a 2251 
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<210> 74 
<211> 1147 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> inisc_f eature 

<223> incyte ID No: 7507072CB1 

<400> 74 

atgcagggca cccctggagg cgggacgcgc 
ctcctggtct tcagctttat cctggcagca 
caggttcttc gagtcctggc caaagatgag 
ggcctgaaac cccagaaggt ggacttctgg 
gatatgagag ttcctttctc tgaactgaaa 
cttgcttaca gcatcatgat aaaggacatc 
atggcgaaat cccgccggct ggagcgcagc 
accctggagg agatatatag ctggattgac 
tcaaaaattc agattggcaa cagctttgaa 
gttctaacag caacccctgc tcagaaactt 
aggtggctgc catagtgaac ttcatcacag 
tccacagcta ctctcagatg cttatgtacc 
atcagaggga gttgtacgat cttgccaagg 
ggatcgagta catttttggc agcatcagca 
fccgactgggc ctacgacagt ggcatcaagt 
ggcagtatgg cttcctgctg ccggccacac 
tggcgcttcg gaccatcatg gagcacaccc 
ggcaggaggc tccatccttc tccccaaggt 
atccccatcc ccatgccctc atcccgacct 
gcaaaaa 



cctgggccat cccccgtgga caggcggaca 60 
gctttgggcc aaatgaattt cacaggggac 120 
aagcagcttt cacttctcgg ggatctggag 180 
cgtggcccag ccaggcccag cctccctgtg 240 
gacatcaaag cttatctgga gtctcatgga 300 
caggtgctgc tggatgagga aagacgggcc 360 
accaacagct tcagttactc atcataccac 420 
aactttgtaa tggagcattc cgatattgtc 480 
aaccagtcca ttcttgtcct gaaggaaatg 540 
atcacgggcc ctcccctcag tcggagccgg 600 
cccatggcaa cttcaaggct ctgatctcca 660 
cttacggccg attgctggag cccgtttcaa 720 
atgcggtgga ggccttgtat aaggtccatg 780 
ccaccctcta tgtggccagt gggatcaccg 840 
acgccttcag ctttgagctc cgggacactg 900 
agatcatccc cacggcccag gagacgtgga 960 
tgaatcaccc ctactagcag cacgactgag 1020 
ctgtggctcc tcccgaaacc caagttatgc 1080 
cttagaaaat aaatacaagt ttgaacaggc 1140 

1147 



<210> 75 
<211> 800 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> mi sc_f eature 

<223> Incyte ID No: 7507083061 



<400> 75 

ggagaagcca tggctgatga gaaaccatcc 
ctgatcaaga cctttctaga tggcattttt 
acagatgaga tacaccttat aggaaaatgt 
ctggttgatg atatcactga gacagctcaa 
tggaattcca aaaaacagct gagttcagat 
atgcctggcc tcaacatccg caacaaagaa 
gaacttgacc ttttggggat gcgagatcta 
aaagagaatc tcacagctca gatggtgctg 
aagccagtgc agatactcat ggtcggctct 
caaaggctca tgtggaaaag gacttcattg 
cttggagaca tgaaacaaat ggctccgtct 
agtattcttg gagtcatcat ctagaggaaa 
ccccaaatat actgacccag ctgcccacca 
atctctttcc tgggaattaa 

<210> 76 
<211> 1296 
<212> DNA 

<213> Homo sapiens 



aacggtgttc tggtccacat ggtgaagttg 60 
gatgatttga tggaaaataa tgtgttaaat 120 
ctaaagtttg tggtgagcaa tgctgaaaac 180 
attgcaggca aaatatttag ggaacacctg 240 
gtatccagtg atggagaaag agaggcgaac 3 00 
ttcaactatc ttcataatcg aaatggttct 3 60 
cttgaaaacc ttggatactc agtggttata 420 
ggattgtttg gttcaccact gacagtggaa 480 
tgcaaggtaa catctgtaat gatgctgtta 540 
ctttcaaatc ttccacacca cataatgttt 600 
tcatttccca aattatctac tacttcagag 660 
tttttcaaaa ggttcaacat tcatttgaga 720 
ttgaaagact atccatgaca cgatatttct 780 

800 



<220> 

<221> inisc_f eature 
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<223> Incyte ID No: 7509097CB1 

<400> 76 

ctttacccta catgcctgag 
agcctggagg actcttcccc 
cagcaagaac accttcagct 
caggccgggc atctccagcc 
cgggccgggc atctccagcc 
caggccgggc atctccagcc 
cggctctggc atcactttcc 
cggtgacaac ctccccaacc 
ccatccgatc atctcctgcc 
gtacgagcct gcccaagttc 
gcgtgctcct cctcattgcc 
gccacacagg gatcaggtac 
gtgacggggt ggtggactgc 
gggacaagtc tctgcttaaa 
actgagggcc atgaccgggc 
ttggcaagtg agtctgcact 
ccagtgggtg ctcactgccg 
ctggaaggtg tacgcgggca 
cgagatcatc atcaacagca 
acttgtcagc atctgtcaac 
agactcttaa agatccttac 
aagggttata aataagtttg 

<210> 77 
<211> 1407 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> inisc_f eature 
<223> Incyte ID No: 7509118CB1 

<400> 77 

ctttacccta catgcctgag aagccaggac caagatggat cttctcctcg acatcagcta 60 
agcctggagg actcttcccc tcagagacca tggagaggga cagccacggg aatgcatctc 120 
cagcaagaac accttcagct ggagcatctc cagcccaggc atctccagct gggacacctc 180 
caggccgggc atctccagcc caggcatctc cagcccaggc atctccagct gggacacctc 240 
cgggccgggc atctccagcc caggcatctc cagctggtac acctccaggc cgggcatctc 300 
caggccgggc atctccagcc caggcatctc cagcccaggc atctccagcc cgggcatctc 360 
cggctctggc atcactttcc aggtcctcat ccggcaggtc atcatccgcc aggtcagcct 420 
cggtgacaac ctccccaacc agagtgtacc ttgttagagc aacaccagtg ggggctgtac 480 
ccatccgatc atctcctgcc aggtcagcac cagcaaccag ggccaccagg gagagcccag 540 
gtacgagcct gcccaagttc acctggcggg agggccagaa gcagctaccg ctcatcgggt 600 
gcgtgctcct cctcattgcc ctggtggttt cgctcatcat cctcttccag ttctggcagg 660 
gccacacagg gatcaggtac aaggagcaga gggagagctg tcccaagcac gctgttcgct 720 
gtgacggggt ggtggactgc aagctgaaga gtgacgagct gggctgcgtg aggtttgact 780 
gggacaagtc tctgcttaaa atctactctg ggtcctccca tcagtggctt cccatctgta 840 
gcagcaactg gaatgactcc tactcagaga agacctgcca gcagctgggt ttcgagagtg 900 
ctcaccggac aaccgaggtt gcccacaggg attttgccaa cagcttctca atcttgagat 960 
acaactccac catccaggaa agcctccaca ggtctgaatg cccttcccag cggtatatct 1020 
ccctccagtg ttcccactgc ggactgaggg ccatgaccgg gcggatcgtg ggaggggcgc 1080 
tggcctcgga tagcaagtgg ccttggcaag tgagtctgca cttcggcacc acccacatct 1140 
gtggaggcac gctcattgac gcccagtggg tgctcactgc cgcccactgc ttcttcgtga 1200 
cccgggagaa ggtcctggag ggctggaagg tgtacgcggg caccagcaac tcatatccgg 1260 
gccccaaagc ttctgcaggg cagaagtcaa agactcttaa agatccttac atggaacact 1320 
tctgttttat aattagggaa actgaagccc aagggttata aataagtttg ctccaaatga 13 80 
cacatctcac attacaaatt gatgacg 1407 

<210> 78 
<211> 1448 



aagccagggc caagatggat 
tcagagacca tggagaggga 
ggagcatctc cagcccaggc 
caggcatctc cagcccaggc 
caggcatctc cagctggtac 
caggcatctc cagcccaggc 
aggtcctcat ccggcaggtc 
agagtgtacc ttgttagagc 
aggtcagcac cagcaaccag 
acctggcggg agggccagaa 
ctggtggttt cgctcatcat 
aaggagcaga gggagagctg 
aagctgaaga gtgacgagct 
atctactctg ggtcctccca 
ggatcgtggg aggggcgctg 
tcggcaccac ccacatctgt 
cccactgctt cttcgtgacc 
ccagcaacct gcaccagttg 
attacaccga tgaggaggac 
tcatatccgg gccccaaagc 
atggaacact tctgttttat 
ctccaaatga cacatc 



cttctcctcg acatcagcta 60 
cagccacggg aatgcatctc 120 
atctccagct gggacacctc 180 
atctccagct gggacacctc 240 
acctccaggc cgggcatctc 300 
atctccagcc cgggcatctc 360 
atcatccgcc aggtcagcct 420 
aacaccagtg ggggctgtac 480 
ggccaccagg gagagcccag 540 
gcagctaccg ctcatcgggt 600 
cctcttccag ttctggcagg 660 
tcccaagcac gctgttcgct 720 
gggctgcgtg aggtttgact 780 
tcagtggctt cccactgcgg 840 
gcctcggata gcaagtggcc 900 
ggaggcacgc tcattgacgc 960 
cgggagaagg tcctggaggg 1020 
cctgaggcag cctccattgc 1080 
gactatgaca tcgccctcgc 1140 
ttctgcaggg cagaagtcaa 1200 
aattagggaa actgaagccc 1260 

1296 
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<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7509312CB1 

<400> 78 

ctcctcgaca tcagctaagc ctggaggact cttcccctca gagaccatgg agagggacag 60 
ccacgggaat gcatctccag caagaacacc ttcagctgga gcatctccag cccaggcatc 120 
tccagctggg acacctccag gccgggcatc tccagcccag gcatctccag cccaggcatc 180 
tccagcccgg gcatctccgg ctctggcatc actttccagg tcctcatccg gcaggtcatc 240 
atccgccagg tcagcctcgg tgacaacctc cccaaccaga gtgtaccttg ttagagcaac 3 00 
accagtgggg gctgtaccca tccgatcatc tcctgccagg tcagcaccag caaccagggc 360 
caccagggag agcccaggta cgagcctgcc caagttcacc tggcgggagg gccagaagca 420 
gctaccgctc atcgggtgcg tgctcctcct cattgccctg gtggtttcgc tcatcatcct 480 
cttccagttc tggcagggcc acacagggat caggtacaag gagcagaggg agagctgtcc 540 
caagcacgct gttcgctgtg acggggtggt ggactgcaag ctgaagagtg acgagctggg 600 
ctgcgtgagg tttgactggg acaagtctct gcttaaaatc tactctgggt cctcccatca 6 60 
gtggcttccc atctgtagca gcaactggaa tgactcctac tcagagaaga cctgccagca 720 
gctgggtttc gagagtgctc accggacaac cgaggttgcc cacagggatt ttgccaacag 780 
cttctcaatc ttgagataca actccaccat ccaggaaagc ctccacaggt ctgaatgccc 840 
ttcccagcgg tatatctccc tccagtgttc ccactgcgga ctgagggcca tgaccgggcg 900 
gatcgtggga ggggcgctgg cctcggatag caagtggcct tggcaagtga gtctgcactt 960 
cggcaccacc cacatctgtg gaggcacgct cattgacgcc cagtgggtgc tcactgccgc 1020 
ccactgcttc ttcgtgaccc gggagaaggt cctggagggc tggaaggtgt acgcgggcac 1080 
cagcaacctg caccagttgc ctgaggcagc ctccattgcc gagatcatca tcaacagcaa 1140 
ttacaccgat gaggaggacg actatgacat cgccctcatg cggctgtcca agcccctgac 1200 
cctgtccggt gagggaatct gcactccccg ctctcctgcc ccccagcccc agcaccctct 1260 
gcagccctcg cacttgtcag catctgtcaa ctcatatccg ggccccaaag cttctgcagg 1320 
gcagaagtca aagactctta aagatcctta catggaacac ttctgtttta taattaggga 13 80 
aactgaagcc caagggttat aaataagttt gctccaaatg acacatctca cattacaaat 1440 
tgatgacg ^^^^ 

<210> 79 

<211> 2360 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 90126902CB1 

<400> 79 

tgccccggag ctcagtccca ccccctccgc cgatgaggcc atgaggcacc gaacggacct 60 
gggccaggtc accccctccc tgtcccatcc ccaatccggg ggactccctg ctgtctggga 120 
gctgagggag ccctggggcc ggggtgcagc tgagggcctc ttctagggac ccaactcctg 180 
atccagcccc tgccggccgc accttccgca gggtgtcaca gaatcatgga ctgagtccac 240 
ttggccccca ctgcacagat ggggaaactg aggcctggga gagtggagtg gctggcctca 3 00 
ggccacacgg aaaggcccca cctcttccag aacctcctgc tcttcctgtg ggccctgctg 360 
aactgtggtt tgggggtcag tgctcagggt ccgggcgagt ggaccccgtg ggtgtcctgg 420 
acccgctgct ccagctcctg cgggcgtggc gtctccgtgc gcagccggcg ctgcctccgg 480 
cttcctgggg aagaaccgtg ctggggagac tcccatgagt accgcctctg ccagttgcca 540 
gactgccccc caggggctgt gcccttccga gacctacagt gtgccctgta caatggccgc 600 
cctgtcctgg gcacccagaa gacctaccag tgggtgccct tccatggggc gcccaaccag 660 
tgcgacctca actgcctggc tgaggggcac gccttctacc acagcttcgg ccgcgtcctg 720 
gacggcaccg cctgcagccc gggtgcccag ggggtctgcg tggctggccg ctgccttagc 780 
gccggctgtg atgggttgtt gggctcgggt gccctcgagg accgctgtgg ccgctgcgga 840 
ggcgccaacg actcgtgcct tttcgtgcag cgcgtgtttc gtgacgccgg tgccttcgct 900 
gggtactgga acgtgaccct gatccccgag ggcgccagac acatccgcgt ggaacacagg 960 
agccgcaacc acctgggtat cctaggatca ctgatggggg gcgatgggcg ctacgtgctt 102 0 
aatgggcact gggtggtcag cccaccaggg acctacgagg cggccggcac gcatgtggtc 1080 
tacacccgag acacagggcc ccaggagaca ttgcaagcag ccgggcccac ctcccatgac 1140 
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ctgctcctac 
cctcgggagc 
cagcctcagc 
cagaccccaa 
caccgactac 
caccaccagg 
tcgccactgc 
gcaccccacc 
gaccagctgc 
cgcatacgcc 
cacagcaaga 
gcttccaggc 
cgcttaagac 
sgaggcacta 
atagctattt 
tgcttggaga 
catctgttta 
atgtgtgaag 
caggcactgt 
ggagacaatc 
gctcctcctt 



aggtcctcct 
gctacagccc 
cccggggggt 
cgctggcccc 
tccactattg 
cccaggagac 
gggcacgcga 
gggactacct 
tgctgcccca 
tgactgcccg 
aagatacatc 
agctctgaaa 
acctctgcat 
aggagactct 
atactcagaa 
cactgtcatg 
cttcccagac 
ggcagagaca 
aacgtaccaa 
ttttcagcct 
tttttttttt 



gcaggagccc 
cttccaggct 
ggagcctcag 
agacccctgc 
cggcagtgac 
ccgctatgag 
gtacgtgtgg 
gatggctgtc 
cgccggctac 
gcgctgtcct 
tgaccagcct 
tacgtcccac 
gcagtcaaag 
gacttttatt 
agtttaacgc 
aacgagcatg 
actgtgctgt 
ctgtcacaaa 
cgtaccaatt 
cagcatttct 



aaccctggca 
cgtgtgcagg 
ccccccgcag 
ccaccctgcc 
tttgtgttcc 
gtgcgcatcc 
gcgccaggcc 
cagcgtcttg 
gcccggccct 
ggctgagccc 
caacgtcaac 
ctgtgcagct 
ccactgtcac 
tcgcctctct 
tgctttcttt 
acaccctgct 
ctctgctctc 
caggcatgcc 
tccccttttc 
ggctggattt 



tcgagtttga 
ccctgggctg 
cccctgctgt 
ctgacacccg 
aggcccgagt 
agctcgtcta 
actgcccctg 
tcagccccga 
ggagccctgc 
ctgcaggagc 
gtatttcccc 
atgtgactcc 
aagccggcag 
ccttggctgc 
ctctttgcgc 
gccctgggta 
tgctactcac 
cc ttagaaga 
ccctggctac 
ccacccatca 



gttctggctc 
gcccctgagg 
cacccctgca 
cggccgcgcc 
gctgggccac 
caagaaccgc 
cccgatgctg 
cggcacacag 
gcraggacagc 
cccggccaca 
tctcaccctg 
ctcccacaca 
gcactggtga 
caggaagctc 
gcgtcacact 
cccagaagat 
acacaccctc 
catgcctaac 
caggaaactc 
acacgtgctt 



1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2360 



<210> 80 

<211> 1109 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> inisc_f eature 

<223> Incyte ID No: 7509352CB1 



<400> 80 

atggcgtggt 

cccagcaggc 

ccagaggagg 

aactttgcta 

acctgggccc 

cgctgtgtgg 

gtcttcaccc 

gccagcttct 

gccaactttg 

gattgccagt 

gtgaaccccc 

tgcaagtatg 

gccaatgcag 

tgtcactgca 

gcccttcaca 

gctcaggaag 

gctctgggcc 

gctgagctcc 

gattcaggag 



ccccaccagc 
cacagtcact 
gccgcctgga 
tccaggaggc 
acagtgccaa 
gcacagagag 
actacgacct 
gtctggagga 
gagaacaggg 
gggtggatat 
actatgaagt 
atgggcaccg 
aactctccct 
cactcctagc 
gagtcccaac 
ccttttgatg 
tgcccctaag 
tcttctgtaa 
ctctcagttt 



caccctcttt 
gggcaccact 
ggtgctgcac 
cacagtggct 
gtacggccaa 
ctccttggac 
cctcactctc 
cacaaactgc 
agtgactgta 
cacagatgtg 
ggcagagtca 
ggtctggctg 
ggagcaggaa 
tgctgccgat 
tcagaggaaa 
gcaagatcac 
ggcctgtggc 
ggaaacccag 
cttaaggag 



ctgttcctgc 
aagctccggc 
cagggccagt 
tgccgccagc 
ggggagggac 
cagtgccaca 
aatggctcca 
cccacaggac 
ggctgctggg 
ggccccggga 
gatttctcca 
cacaactgcc 
cagcgtctca 
acaccagata 
agggccagtg 
caatccagat 
ctatggaata 
tcatccctga 



tgctgctagg 
tggtgggccc 

ggggcaccgt 
tgggcttcga 
ccatctggct 
ggcattacca 
aggtggctga 
tgcagcggcg 
acacctaccg 
attatatctt 
acaatatgct 
acacagggaa 
ggaacaacct 
cctcagctta 
ccaaggggca 
ggtattgctc 
tgtcctccag 
atcttgccac 



ccagccccct 
agagagcaag 
gtgtgatgac 
agctgccttg 
ggacaatgtg 
cagcattgag 
ggggcacaag 
ctacgcatgt 
gcatgacatt 
ccaggtgatt 
gcagtgccgc 
ttcataccca 
catctgaagc 
ttggagccat 
ccaagaacct 
cctcaggatg 
gctttgctca 
agagatccgg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1109 



<210> 81 

<211> 905 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7509341CB1 



<400> 81 

caacaccaag cgcatggact ctgaccccag tgcagtgtct gtagggaaga ggagccatgg 60 
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ggcttcgggc aggccccatc ctgcttctgc tgctgtggct gctgccaggg gcccattggg 120 
atgtgctgcc ttcagaatgc ggccactcca aggaggccgg gaggattgtg ggaggccaag 180 
acacccagga aggacgctgg ccgtggcagg ttggcctttg gttgacctca gtggggcatg 240 
tatgtggggg ctccctcatc cacccacgct gggtgctcac agccgcccac tgcttcctga 300 
ggtctgagga tcccgggctc taccatgtta aagtcggagg gctgacaccc tcactttcag 360 
agccccactc ggccttggtg gctgtgagga ggctcctggt ccactcctca taccatggga 420 
ccaccaccag cggggacatt gccctgatgg agctggactc ccccttgcag gcctcccagt 480 
tcagccccat ctgcctccca ggaccccaga cccccctcgc cattgggacc gtgtgctggg 540 
gtgactccgg- ggggccgctg gtctgcccca tcaatgatac gtggatccag gccggcattg 600 
tgagctgggg attcggctgt gcccggcctt tccggcctgg tgtctacacc caggtgctaa 660 
gctacacaga ctggattcag agaaccctgg ctgaatctca ctcaggcatg tctggggccc 720 
gcccaggtgc cccaggatcc cactcaggca cctccagatc ccacccagtg ctgctgcttg 780 
agctgttgac cgtatgcttg cttgggtccc tgtgaaccat gagccatgga gtccgggatc 840 
ccctttctgg taggattgat ggaatctaat aataaaaact gtaggttttt tatgtgtatt 900 
ttaaa 905 



<210> 82 

<211> 626 

<212> DNA 

<213> Homo sapiens 



<22.0> 

<221> misc_feature 

<223> incyte ID No: 7509367CB1 



<400> 82 

caacaccaag 

ggcttcgggc 

atgtgctgcc 

acacccagga 

tatgtggggg 

gggtgactcc 

tgtgagctgg 
aagctacaca 
ccgcccaggt 
tgagctgttg 
tcccctttct 



cgcatggact 
aggccccatc 
ttcagaatgc 
aggacgctgg 
ctccctcatc 
ggggggccgc 
ggattcggct 
gactggattc 
gccccaggat 
accgtatgct 
ggtaggattg 



ctgaccccag 
ctgcttctgc 
ggccactcca 
ccgtggcagg 
cacccacgct 
tggtctgccc 
gtgcccggcc 
agagaaccct 
cccactcagg 
tgcttgggtc 
atggaa 



tgcagtgtct 
tgctgtggct 
aggaggccgg 
ttggcctgtg 
gggtgctcac 
catcaatgat 
tttccggcct 
ggctgaatct 
cacctccaga 
cctgtgaacc 



gtagggaaga 
gctgccaggg 
gaggattgtg 
gttgacctca 
agccgcccac 
acgtggatcc 
ggtgtctaca 
cactcaggca 
tcccacccag 
atgagccatg 



ggagccatgg 
gcccattggg 
ggaggccaag 
gtggggcatg 
tgcttcctga 
aggccggcat 
cccaggtgct 
tgtctggggc 
tgctgctgct 
gagtccggga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

626 



<210> 83 

<211> 4095 

<212> DMA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7500455CB1 



<400> 83 

cccacgcgtc cgggggtgag ctgggctggc ttccgtcctg gtagccaagg ctaatcctcc 60 
ctcgagttct tgggagatgg gcatttggcg agaaggctgg cgttagtgaa gcgcgcccgg 120 
cgtcacggaa agaagcactt gtaaggaaat atagcatcca ttgtgaaagt ggaaaagtaa 180 
agataattca tcatgcctgc tgtggcttca gttcctaaag aactctacct cagttcttca 240 
ctaaaagacc ttaataagaa gacagaagtt aaaccagaga aaataagcac taagaggatt 300 
atttccattc aatacttgga cctggaaaca tcaaaaaagc tgtcgaagaa gctgaaagac 360 
tctctgaaag ccttaaatta agatatgaag aagctgaagt ccggaaaaaa cttgaggaaa 420 
aagacaggca ggaggaagca cagcggctac aacaaaaaag gcaggaaaca ggaagagagg 480 
atggtggcac attggctaaa ggctctttgg agaatgtttt ggattccaaa gacaaaaccc 540 
aaaagagcaa tggtgaaaag aatgaaaaat gtgagaccaa agagaaagga gcaatcacag 600 
caaaggaact atacacaatg atgacggata aaaacatcag cttgattata atggatgctc 660 
gaagaatgca ggattatcag gattcctgta ttttacattc tctcagtgtt cctgaagaag 720 
ccatcagtcc aggagtcact gctagttgga ttgaagcaca cctgccagat gattctaaag 780 
acacatggaa gacgaggggg aatgtggagt atgtggtact tcttgactgg tttagttctg 840 
ccaaagattt acagattgga acaactctcc ggagtctgaa agatgcactt ttcaagtggg 900 
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aaagtaaaac 
tcctttgtta 
atgaagaggt 
ctaaacctgc 
taagtggtca 
ctgcttctaa 
cacagattga 
ctgaaagtac 
ccaccaagcc 
ttcatgcaga 
ggcagcaaga 
agaaacaaga 
tggagaagaa 
gcaaagaaat 
agaaatctgt 
caggagatgt 
ttaaaggaca 
ataccgaaag 
ggaggatcgt 
cctttcgtta 
cttcatctgc 
agcgggatag 
ctattcaaga 
agccaacatg 
tcaatcctgt 
cttgttatat 
tcaaccgaaa 
aagtggcaga 
tcagtccaaa 
gtcagcaaga 
ataaagctga 
aagctgcaga 
tttttcaggg 
catttgaggc 
aggattgcct 
gcagtcattg 
ctgtgctttt 
agacatctgt 
agaacaattt 
gaggccacta 
atcatgaagt 
tttatacttc 
aactagttat 
gtagctatag 
atataattcc 
acaggtacca 
tagctaagca 
cactgttatg 
atgatgtaaa 
cgatgtacat 
ttttctttca 
gcaatttttc 
aatggaaact 
tgagaaaaat 



tgtcctgcgc 
tccccagtat 
gtctatctca 
tgcccagacg 
aaatgagaga 
atctgatgtt 
tcgtactaaa 
aaaccatgag 
agtagttttt 
aactgctctt 
ggaacagaaa 
agctgaagaa 
agaaagtgaa 
aacaggagta 
agaagatagg 
gccccataca 
accagaaagt 
aaataaagct 
aacaggactg 
ttatcattca 
acctccttcc 
ggaaccttcc 
ggaagagaag 
ttatcctaaa 
ttttggaggt 
gaactcaata 
ctgttatcag 
agaatttggt 
ggactttaaa 
ttcacaagaa 
taatcggaag 
acatgcctgg 
tcaattcaaa 
cttcatgtat 
tagattattt 
cagagctcga 
agtgcatctg 
ggacttcccg 
gaagaaatat 
cacagcctat 
ttctgatatc 
attgggacca 
cttttaaaag 
ctggccattt 
ggtcagtgct 
tttatttcaa 
ttattattcg 
gccttttcac 
gcagatagga 
agtttaacac 
gcctattttg 
tgttggcttt 
ttaatttttt 
aatta 



aatgagcctt 
acaacaaatg 
ttggatttta 
ccacctgcat 
atgggaccac 
tcacccataa 
aaaccagcag 
caacaatctc 
tctccaactc 
ctaatggaaa 
gagaaactga 
aatgaaatta 
caggccaaga 
aaaagacaaa 
gggaaaaggt 
tctgtgacag 
ggaattctaa 
caacgagaac 
ccttcaggct 
cccaccaaca 
acccctccaa 
aaactgaagc 
aggaagccaa 
gctgagatct 
tctggaccag 
ttgcagtgcc 
gatgatatta 
ataatcatga 
atcaccattg 
ttgcttctgt 
agatataaag 
cagaaacaca 
tctacagtac 
ttgtctctac 
tccaaagaag 
cgggattctc 
aaacgttttt 
ttagaaaatc 
aatttgtttt 
tgtaaaaatg 
tccgtttctt 
cgagtaactg 
gctcagcaac 
agaggaattc 
gacaaataac 
aacaactttt 
actggtctaa 
atttctaaat 
atgtatgtgt 
gtggtccttt 
agttaacttc 
gggctgtatt 
taaaacgaga 



tggttttaga 
ctaaggtcac 
cttatccctc 
ctatagaagt 
tgaatatatc 
ttcagccagt 
tcaaattgcc 
ctcagagtgg 
tcatgttaac 
aaaacaaaca 
ggaaggaaga 
cagagaagca 
aagaagataa 
gtaaaagtga 
gtccaacccc 
gggattcagg 
ggacaggaac 
ctttgacaag 
gggccaagtt 
ctgttcatat 
ctcataaagc 
gctcctactc 
cagtaactcc 
caaggctttc 
ctcttactgg 
tatgtaacgc 
acaggtcaaa 
aagccctgtg 
ggaagatcaa 
tcctaatgga 
aagaaaataa 
agcagctcaa 
agtgcctcac 
cactagcatc 
aaaaactcac 
taaaaaagat 
cctacgatgg 
ttgacttgtc 
ctgtttcaaa 
cagcaagaca 
ctgtgaaatc 
atgtagccac 
acaactcttg 
taggacagtg 
atttaacaag 
ttagtctgct 
aaactattgt 
cccatcttga 
acatatttat 
tgtgaaacct 
agtgctttct 
tgtgcactaa 
atttcattta 



gggaggctat 
tccaccccca 
attggaagaa 
agatgaaaat 
aactccagtt 
gcctagtata 
tgaagagcat 
aaaagttatt 
agatgaagaa 
agaaaaagaa 
acaagaacaa 
acaaaaagca 
agaaacctca 
acatgaaact 
agaaatacag 
ttcaggcaag 
ttttagagag 
agcacgaagt 
tcttgaccca 
gtacccaccg 
caagccacag 
ctccccagat 
aacagttaat 
tgcttctcag 
acttcgtaac 
tccacatttg 
tttgttgggg 
gacaggacag 
tgaccagttt 
tggtctccat 
tgatcatctc 
tgagtctatt 
atgtcacaaa 
cacaagtaaa 
agataacaac 
agaaatctgg 
caggtggaaa 
acagtatgtt 
tcactacggt 
acggtggttt 
ttcagcagct 
ataaggagac 
aaatgcttat 
ggagctgtgt 
tattgcagta 
ccaaagttaa 
tatctttttt 
tatactatga 
tgcatacttg 
agaactcaga 
tagggaaatg 
atctttattc 
cagctacatt 



gaaaactggc 
cgacgccaga 
tcaattcctt 
atagaattga 
gaaccagttg 
aagaatgttc 
agaataaaat 
cctgatcgtt 
aaggctcgta 
cttcgggaaa 
aaagccaaaa 
aaagaagaaa 
gcaaagaggg 
tctgatgcca 
aaaaagtcaa 
ccatttaaga 
gatacagacg 
gaagaaatgg 
atcactggaa 
gaaatggctc 
attcctgctg 
ataacccagg 
cgggaaaaca 
attcggaacc 
ttaggaaata 
gctgattatt 
cataaaggtg 
tatagatata 
gcaggataca 
gaagatctaa 
gatgacttta 
attgttgcac 
aagtctagga 
tgtacattac 
agattttact 
aagttaccac 
caaaaattac 
attggtccaa 
gggctggatg 
aagtttgatg 
tatatcGtct 
ataggttata 
caggataatg 
tactagcact 
atcatcactt 
aataattaac 
ttttcctttt 
atactctaga 
cacatcaaat 
ggattgcttt 
acagggcaaa 
taaaaaaaaa 
aaaatcttaa 



960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4095 



<210> 84 

<211> 1308 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc^f eature 

<223> Incyte ID No: 7510401CB1 



85/112 



wo 03/063688 



PCT/US03/02500 



<400> 84 

ggcacttccg cttcccctcg gctttcttct cgtcggtgtt cccggctgct atagagccgg 60 
gtgagagagc gagcgcccgt cggcgggtgt cgagggcggg ttgcctcgcg ctgacccttc 120 
ccgccctcct tctcgtcaca caccaggtcc ccgcggaagc cgcggtgtcg gcgccatggc 180 
ggagctgacg gctcttgaga gtctcatcga gatgggcttc cccaggggac gcgcggagaa 240 
ggctctggcc ctcacaggga accagggcat cgaggctgcg atggactggc tgatggagca 300 
cgaagacgac cccgatgtgg acgagccttt agagactccc cttggacata tcctgggacg 360 
ggagcccact tcctcagagc aaggcggcct tgaaggatct ggttctgctg ccggagaagg 420 
caaacccgct ttgagtgaag aggaaagaca ggaacaaact aagaggatgt tggagctggt 480 
ggcccagaag cagcgggaga gtgaagaaag agaggaacgg gaggcattgg aacgggaacg 540 
gcagcgcagg agacaagggc aagagttgtc agcagcacga cagcggctac aggaagatga 600 
gatgcgccgg gctgctgagg agaggcggag ggaaaaggcc gaggagttag cagccagaca 660 
aagagttaga gaaaagatcg agagggacaa agcagagaga gccaagaagg taggtgactg 720 
agaagacgcc acgggtaatg gagtgggcag gttagtaaat cttttccagc ttcagaggga 780 
aagcatgggc ttatatgatc tgtttcacct gaggatgggt gagtacctga ttgactgttt 840 
tactttgccc tttagtatgg tggcagtgtg ggctctcagc cacccccagt ggcaccagag 900 
ccaggtcctg ttccctcttc tcccagccag gagcctccca ccaagcggga gtatgaccag 960 
tgtcgcatac aggtcaggct gccagatggg acctcactga cccagacgtt ccgggcccgg 1020 
gaacagctgg cagctgtgag gctctatgtg gagctccacc gtggggagga actaggtggg 1080 
ggccaggacc ctgtgcaatt gctcagtggc ttccccagac gggccttctc agaagctgac 1140 
atggagcggc ctctgcagga gctgggactc gtgccttctg ctgttctcat tgtggccaag 1200 
aaatgtccca gctgagggcc tttgtcccat tgtccctctg tgaccccttc atctttgata 1260 
aagcactgac atctccttcc taataaatag accctgagtt ctgtaaaa 1308 

<210> 85 
<211> 1196 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: 7504702CB1 

<400> 85 

cagcgatgtc tccaccaccg ctgctgcaac ccctgctgct gctgctgcct ctgctgaatg 60 
tggagccttc cggggccaca ctgatccgca tccctcttca tcgagtccaa cctggacgca 120 
ggatcctgaa cctactgagg ggatggagag aaccagcaga gctccccaag ttgggggccc 180 
catcccctgg ggacaagccc atcttcgtac ctctctcgaa ctacagggat ggttacacca 240 
ccgatttgat cccaaagcct ctagctcctt ccaggccaat gggaccaagt ttgccattca 300 
atatggaact gggcgggtag atggaatcct gagcgaggac aagctgacta ttggtggaat 360 
caagggtgca tcagtgattt tcggggaggc tctctgggag cccagcctgg tcttcgcttt 420 
tgcccatttt gatgggatat tgggcctcgg ttttcccatt ctgtctgtgg aaggagttcg 480 
gcccccgatg gatgtactgg tggagcaggg gctattggat aagcctgtct tctcctttta 540 
cctcaacagg gaccctgaag agcctgatgg aggagagctg gtcctggggg gctcggaccc 600 
ggcacactac atcccacccc tcaccttcgt gccagtcacg gtccctgcct actggcagat 660 
ccacatggag cgtgtgaagg tgggcccagg gctgactctc tgtgccaagg gctgtgctgc 720 
catcctggat acgggcacgt ccctcatcac aggacccact gaggagatcc gggccctgca 780 
tgcagccatt gggggaatcc ccttgctggc tggggagtac atcatcctgt gctcggaaat 840 
cccaaagctc cccgcagtct ccttccttct tgggggggtc tggtttaacc tcacggccca 900 
tgattacgtc atccagacta ctcgaaatga cgtcttcttg gggacgtatg tggccgtctt 9 60 
cgaccgcggg gacatgaaga gcagcgcccg ggtgggcctg gcgcgcgctc gcactcgcgg 1020 
agcggacctc ggatggggag agactgcgca ggcgcagttc cccgggtgac gcccaagtga 1080 
agcgcatgcg cagcgggtgg tcgcggaggt cctgctaccc agtaaaaatc cactatttcc 1140 
attgaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa 1196 

<210> 86 
<211> 1419 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> incyte ID No: 7509113CB1 



86/112 



wo 03/063688 



PCTAJS03/02500 



<400> 86 

ctttgctcca gcagcctgtc ccagtgagga cagggacaca gtactcggcc acaccatggg .60 
gcgcccactg cacctcgtcc tgctcagtgc ctccctggct ggcctcctgc tgctcgggga 120 
aagtctgttc atccgcaggg agcaggccaa caacatcctg gcgagggtca cgagggccaa 180 
ttcctttctt gaagagatga agaaaggaca cctcgaaaga gagtgcatgg aagagacctg 240 
ctcatacgaa gaggcccgcg aggtctttga ggacagcgac aagacgaatg aattctggaa 300 
taaatacaaa gatggcgacc agtgtgagac cagtccttgc cagaaccagg gcaaatgtaa 3 60 
agacggcctc ggggaataca cctgcacctg tttagaagga ttcgaaggca aaaactgtga 420 
attatggccc tacccctgtg ggaaacagac cctggaacgc aggaagaggt cagtggccca 480 
ggccaccagc agcagcgggg aggcccctga cagcatcaca tggaagccat atgatgcagc 540 
cgacctggac cccaccgaga accccttcga cctgcttgac ttcaaccaga cgcagcctga 600 
sraggggcgac aacaacctca ccaggatcgt gggaggccag gaatgcaagg acggggagtg 660 
tccctggcag gccctgctca tcaatgagga aaacgagggt ttctgtggtg gaactattct 720 
gagcgagttc tacatcctaa cggcagccca ctgtctctac caagccaaga gattcaaggt 7 80 
gagggtaggg gaccggaaca cggagcagga ggagggcggt gaggcggtgc acgaggtgga 840 
ggtggtcatc aagcacaacc ggttcacaaa ggagacctat gacttcgaca tcgccgtgct 900 
ccggctcaag acccccatca ccttccgcat gaacgtggcg cctgcctgcc tccccgagcg 960 
tgactgggcc gagtccacgc tgatgacgca gaagacgggg attgtgagcg gcttcgggcg 1020 
cacccacgag aagggccggc agtccaccag gctcaagatg ctggaggtgc cctacgtgga 1080 
ccgcaacagc tgcaagctgt ccagcagctt catcatcacc cagaacatgt tctgtgccgg 1140 
ctacgacacc aagcaggagg atgcctgcca gggggacagc gggggcccgc acgtcacccg 1200 
cttcaaggac acctacttcg tgacaggcat cgtcagctgg ggagagggct gtgcccgtaa 1260 
srerggaagtac gggatctaca ccaaggtcac cgccttcctc aagtggatcg acaggtccat 1320 
gaaaaccagg ggcttgccca aggccaagag ccatgccccg gaggtcataa cgtcctctcc 1380 
attaaagtga gatcccactc aaaaaaaaaa aaaaggggg 1419 

<210> 87 
<211> 1200 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> i:nisc_f eature 

<223> Incyte ID No: 7509140CB1 

<400> 87 

gcagcctgtc ccagtgagga cagggacaca gtactcggcc acaccatggg gcgcccactg 60 
cacctcgtcc tgctcagtgc ctccctggct ggcctcctgc tgctcgggga aagtctgttc 120 
atccgcaggg agcaggccaa caacatcctg gcgagggtca cgagggccaa ttcctttctt 180 
gaagagatga agaaaggaca cctcgaaaga gagtgcatgg aagagacctg ctcatacgaa 240 
gaggcccgcg aggtctttga ggacagcgac aagacgaatg aattctggaa taaatacaaa 300 
gatggcgacc agtgtgagac cagtccttgc cagaaccagg gcaaatgtaa agacggcctc 360 
ggggaataca cctgcacctg tttagaagga ttcgaaggca aaaactgtga attattcaca 420 
cggaagctct gcagcctgga caacggggac tgtgaccagt tctgccacga ggaacagaac 4 80 
tctgtggtgt gctcctgcgc ccgcgggtac accctggctg acaacggcaa ggcctgcatt 540 
cccacagggc cctacccctg tgggaaacag accctggaac gcaggaagag gtcagtggcc 600 
caggccacca gcagcagcgg ggaggcccct gacagcatca catggaagcc atatgatgca 660 
gccgacctgg accccaccga gaaccccttc gacctgcttg acttcaacca gacgcagcct 720 
gagaggggcg acaacaacct caccaggatc gtgggaggcc aggaatgcaa ggacggggag 780 
tgtccctggc aggccctgct catcaatgag gaaaacgaga agggccggca gtccaccagg 840 
ctcaagatgc tggaggtgcc ctacgtggac cgcaacagct gcaagctgtc cagcagcttc 900 
atcatcaccc agaacatgtt ctgtgccggc tacgacacca agcaggagga tgcctgccag 960 
ggggacagcg ggggcccgca cgtcacccgc ttcaaggaca cctacttcgt gacaggcatc 1020 
gtcagctggg gagagggctg tgcccgtaag gggaagtacg ggatctacac caaggtcacc 1080 
gccttcctca agtggatcga caggtccatg aaaaccaggg gcttgcccaa ggccaagagc 1140 
catgccccgg aggtcataac gtcctctcca ttaaagtgag atcccactca aaaaaaaaaa 1200 

<210> 88 
<211> 1982 
<212> DNA 

<213> Homo sapiens 
<220> 



87/112 



wo 03/063688 



PCT/US03/02500 



<221> inisc_f eature 

<223> Incyte ID No: 7509223CB1 

<220> 

<221> unsure 

<222> (1) (1982) 

<223> a, t, c, g, or other 

<400> 88 

gttttgggta tcgtaccgtt tgattaccac ccttggagcg catttgaatt atgccatgtg 60 
cggaaacgcc tgcagtaccg tccggatccc ggtcgacaga cattccagag aatcctgttt 120 
tagaccacct tgcgctgaga gcacacccgt gggtcaaagg cagccacccg ggggtcaaag 180 
ggcaccatca ggtactcccc agggaaggct ttgcgccacc agtcacctgc aaccccgcct 240 
cacctccgat gcctgctgtg cccagggtgt cccgctcata gcgacggcct gtgcgttcat 3 00 
acgacctcga gcgctctcgt cgctgatctc tgtagtaatg acttttcttc ttgtattttc 360 
ccggattatc cgcccgctcc cgtgacctgc ttcgagaacg ggaagaactc cggctccgag 420 
acttgtgtgg cttctggggg tacttgcagt ccttggactt ccgggagccc cgaatctcag 480 
agcctttgta gggagcgctg cggggggcct gtctcggtgg ggagtcactc ctgctccggg 540 
agcggctctg cgactgannn nnnnnnnnnn rmnnnnnnnn imnrmmmnn nnrmnnnnnn 600 
nnngggacac agtactcggc cacaccatgg ggcgcccact gcacctcgtc ctgctcagtg 660 
cctccctggc tggcctcctg ctgctcgggg aaagtctgtt catccgcagg gagcaggcca 720 
acaacatcct ggcgagggtc acgagggcca attcctttct tgaagagatg aagaaaggac 780 
acctcgaaag agagtgcatg gaagagacct gctcatacga agaggcccgc gaggtctttg 840 
aggacagcga caagacgaat gaattctgga ataaatacaa agatggcgac cagtgtgaga 900 
ccagtccttg ccagaaccag ggcaaatgta aagacggcct cggggaatac acctgcacct 960 
gtttagaagg attcgaaggc aaaaactgtg aattattcac acggaagctc tgcagcctgg 1020 
acaacgggga ctgtgaccag ttctgccacg aggaacagaa ctctgtggtg tgctcctgcg 1080 
cccgcgggta caccctggct gacaacggca aggcctgcat tcccacaggg ccctacccct 1140 
gtgggaaaca gaccctggaa cgcaggaaga ggtcagtggc ccaggccacc agcagcagcg 1200 
gggaggcccc tgacagcatc acatggaagc catatgatgc agccgacctg gaccccaccg 1260 
agaacccctt cgacctgctt gacttcaacc agacgcagcc tgagaggggc gacaacaacc 1320 
tcaccaggat cgtgggaggc caggaatgca aggacgggga gtgtccctgg caggccctgc 1380 
tcatcaatga ggaaaacgag gtggaggtgg tcatcaagca caaccggttc acaaaggaga 1440 
cctatgactt cgacatcgcc gtgctccggc tcaagacccc catcaccttc cgcatgaacg 1500 
tggcgcctgc ctgcctcccc gagcgtgact gggccgagtc cacgctgatg acgcagaaga 1560 
cggggattgt gagcggcttc gggcgcaccc acgagaaggg ccggcagtcc accaggctca 1620 
agatgctgga ggtgccctac gtggaccgca acagctgcaa gctgtccagc agcttcatca 1680 
tcacccagaa catgttctgt gccggctacg acaccaagca ggaggatgcc tgccaggggg 1740 
acagcggggg cccgcacgtc acccgcttca aggacaccta cttcgtgaca ggcatcgtca 1800 
gctggggaga gggctgtgcc cgtaagggga agtacgggat ctacaccaag gtcaccgcct 1860 
tcctcaagtg gatcgacagg tccatgaaaa ccaggggctt gcccaaggcc aagagccatg 1920 
ccccggaggt cataacgtcc tctccattaa agtgagatcc cactcaaaaa aaaaaaaaag 1980 
gg 1982 

<210> 89 
<211> 1282 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7509272CB1 



<400> 89 

gccagggacc aagatggatc ttctcctcga 
cagagaccat ggagagggac agccacggga 
gagcatctcc agcccaggca tctccagctg 
aggcatctcc agcccaggca tctccagctg 
aggcatctcc agctggtaca cctccaggcc 
aggcatctcc agcccaggca tctccagccc 
ggtcagcacc agcaaccagg gccaccaggg 
cctggcggga gggccagaag cagctaccgc 
tggtggtttc gctcatcatc ctcttccagt 



catcagctaa gcctggagga ctcttcccct 60 
atgcatctcc agcaagaaca ccttcagctg 120 
ggacacctcc aggccgggca tctccagccc 18 0 
ggacacctcc gggccgggca tctccagccc 240 
gggcatctcc aggccgggca tctccagccc 300 
gggcatctcc ggctctggca tcactttcca 360 
agagcccagg tacgagcctg cccaagttca 42 0 
tcatcgggtg cgtgctcctc ctcattgccc 480 
tctggcaggg ccacacaggg atcaggtaca 540 



88/112 



wo 03/063688 



PCT/US03/02500 



aggagcagag ggagagctgt cccaagcacg 
agctgaagag tgacgagctg ggctgcgtga 
tctactctgg gtcctcccat cagtggcttc 
actcagagaa gacctgccag cagctgggtt 
cccacaggga ttttgccaac agcttctcaa 
gcctccacag gtctgaatgc ccttcccagc 
gactgagggc catgaccggg cggatcgtgg 
cttggcaagt gagtctgcac ttcggcacca 
cccagtgggt gctcactgcc gcccactgct 
gctggaaggt gtacgcgggc accagcaacc 
ccgagatcat catcaacagc aattacaccg 
ccttacatgg aacacttctg ttttataatt 
agtttgctcc aaatgacaca tc 



ctgttcgctg tgacggggtg gtggactgca 600 
ggtttgactg ggacaagtct ctgcttaaaa 660 
ccatctgtag cagcaactgg aatgactcct 720 
tcgagagtgc tcaccggaca accgaggttg 780 
tcttgagata caactccacc atccaggaaa 840 
ggtatatctc cctccagtgt tcccactgcg 900 
gaggggcgct ggcctcggat agcaagtggc 960 
cccacatctg tggaggcacg ctcattgacg 1020 
tcttcgtgac ccgggagaag gtcctggagg 1080 
tgcaccagtt gcctgaggca gcctccattg 1140 
atgaggagga agtcaaagac tcttaaagat 1200 
agggaaactg aagcccaagg gttataaata 1260 

1282 



<210> 90 

<211> 1337 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> inisc_f eature 

<223> Incyte ID No; 7509327CB1 



<400> 90 

gccagggacc aagatggatc ttctcctcga 
cagagaccat ggagagggac agccacggga 
gagcatctcc agcccaggca fcctccagctg 
aggcatctcc agcccaggca tctccagctg 
aggcatctcc agctggtaca cctccaggcc 
aggcatctcc agcccaggca tctccagccc 
ggtcctcatc cggcaggtca tcatccgcca 
gagtgtacct tgttagagca acaccagtgg 
ggtcagcacc agcaaccagg gccaccaggg 
cctggcggga gggccagaag cagctaccgc 
tggtggtttc gctcatcatc ctcttgctca 
tgccaacagc ttctcaatct tgagatacaa 
tgaatgccct tcccagcggt atatctccct 
gaccgggcgg atcgtgggag gggcgctggc 
tctgcacttc ggcaccaccc acatctgtgg 
cactgccgcc cactgcttct tcgtgacccg 
cgcgggcacc agcaacctgc accagttgcc 
caacagcaat tacaccgatg aggaggacga 
gcccctgacc ctgtccggtg agggaatctg 
gcaccctctg cagccctcgc acttgtcagc 
ttctgcaggg cagaagtcaa agactcttaa 
aattagggaa actgaagccc aagggttata 
attacaaatt gatgacg 



catcagctaa gcctggagga ctcttcccct 60 
atgcatctcc agcaagaaca cot t cage tg 120 
ggacacctcc aggccgggca tctccagccc 180 
ggacacctcc gggccgggca tctccagccc 240 
gggcatctcc aggccgggca tctccagccc 3 00 
gggcatctcc ggctctggca tcactttcca 3 60 
ggtcagcctc ggtgacaacc tccccaacca 420 
gggctgtacc catccgatca tctcctgcca 480 
agagcccagg tacgagcctg cccaagttca 540 
tcatcgggtg cgtgctcctc ctcattgccc 600 
ccggacaacc gaggttgccc acagggattt 660 
ctccaccatc caggaaagcc tccacaggtc 720 
ccagtgttcc cactgcggac tgagggccat 780 
ctcggatagc aagtggcctt ggcaagtgag 840 
aggcacgctc attgacgccc agtgggtgct 900 
ggagaaggtc ctggagggct ggaaggtgta 960 
tgaggcagcc tccattgccg agatcatcat 1020 
ctatgacatc gccctcatgc ggctgtccaa 1080 
cactccccgc tctcctgccc cccagcccca 1140 
atctgtcaac tcatatccgg gccccaaagc 1200 
agatccttac atggaacact tctgttttat 1260 
aataagtttg ctccaaatga cacatctcac 1320 

1337 



<210> 91 
<211> 843 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> mis cofeature 

<223> Incyte ID No: 7504677CB1 



<400> 91 

cgcaggacaa cgcctgccac gcgtcggaac 
gacacgcggg agggagagca gtgttctgct 
attcagattc attgttttct tttatctgtg 
agaggagagc accgaagaag tgaaaataga 
gacaagcaag aagggagacc tactaaatgc 



tcggccgcgg gacatccacg gggcgcgagt 60 
ggagccgatg ccaaaaacca tgcatttctt 120 
gggccttttt actgctcaga gacaaaagaa 180 
agttttgcat cgtccagaaa actgctctaa 240 
ccattatgac ggctacctgg ctaaagacgg 300 
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ctcgaaattc 
ttgaacttta 
tggacaatga 
aaaaagatga 
agaagaatga 
acgatgaact 
tgtataaaac 
tattttgatc 
ttaacttgca 
tag 



tactgcagca 
tgctgtgacc 
caggcagctc 
gaagccacgt 
ccatgatggt 
atagcatatt 
aaagtcactt 
tccccaatac 
ggaatggtat 



gaaggcaaga 
aaaggaccac 
tctaaagccg 
gacaagtcat 
gatggcttca 
tgtatttcta 
ttctccaagt 
attgattttg 
gactgtgttt 



ttccaccgga 
ggagcattga 
agataaacct 
atcaggatgc 
tttctcccaa 
cttttttttt 
tgtatttgct 
gtataataaa 
cctactgctt 



tgctacattg 
gacatttaaa 
ctacttgcaa 
agttttagaa 
ggaatacaat 
tagctattta 
atttttcccc 
tgtgaggctg 
tattctgtaa 



atttttgaga 
caaatagaca 
agggaatttg 
gatattttta 
gtataccaac 
ctgtacttta 
tatgagaaga 
ttttgcaaac 
acaagaattg 



360 
420 
480 
540 
600 
660 
720 
780 
840 
843 



<210> 92 

<211> 3360 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> mis cofeature 
<223> Incyte ID No: 



7504534CB1 



<400> 92 

ggggcggggc cgcggaggag atggcggaag 
agactcagaa gtccgagctt ggacccttaa 
ggtatcttat tgacagccgg tggttcaagc 
Srgggcatgta caatgtgggt gaacataacc 
tattttcaga tcctgagagt cagaccttga 
tattggtccc taccgaggcg tggaataaac 
agcaacccat cgtcagaaaa gttgtggagc 
aggtgtattt gctggaactg aagctctgtg 
gccatttcag caaggcagac accattgcaa 
acatccctgc ggagcgtgaa acacggctct 
agttgagcaa gctagacaac actgtccagg 
taattgagcc tcaaaatgaa gatggcacat 
gcactgcgcc tagcagaaat tttactacct 
cagtgtctgc ctctctcatt gcaaatggtg 
ccggtgtcag caggggtgga tctggctttt 
cctctcatat acaacctggg ctctgtggac 
actccgcttt gcagtgtttg agcaacactg 
agtatgaagc cgaaatcaac agagacaacc 
cctatgctga actcattaag cagatgtggt 
tgttcaaaac tcaagtagga cgttttgctc 
ctcaggagct gctggccttt cttctagatg 
aaaagcccta cttggagctg aaggatgcca 
aagcctggga gaatcacagg ttgaggaatg 
tcttcaaatc tactttggtt tgcccagaat 
tttgctatct aacgctgcca ctgcccttga 
ttcctgctga ccctcactgc agacctactc 
Ctgtgtccga cctgtgcgag gctctctcca 
tggtcgcaga tgtgtataat caccgattcc 
accacatcat gcctcgggat gacattttcg 
gctcggaatg tgtcacgctt ccagtctact 
cttcctccgc atcagcgcta tatgggcagc 
taacccttga gtctttgtac caggctgttt 
ctttacctga tgagtttggc agctcaccct 
acagctgtga aggagaagat gaggaagaaa 
tttcagaaac agaaggcagt ggggaagatg 
aaaagaagat caaaggccag ccctgcccaa 
cctatggaac agctgacata aattcacttg 
ctcgatctac actggccatg gattgggaca 
aagaatctga ggcctacgag aagcatgtga 
ccacagtggc cctgagagac tgcatcgagc 
atgacccctg gtactgtccc aactgtaaga 
tatggtcctt gcccaagatc ctggtggtcc 



gtggaggctg ccgtgagcga ccggatgcgg 6 0 
tgaggaccac actccaacgc ggggcgcagt 120 
agtggaagaa gtatgtgggc tttgacagct 180 
tatttcctgg cccaatagac aactctgggc 240 
aagaacactt aattgatgaa ttggactatg 3 00 
tactaaactg gtacggctgt gtagaaggcc 3 60 
atggcctgtt tgtcaagcac tgcaaagtcg 420 
agaacagtga ccccaccaat gtgctgagtt 480 
ccatcgagaa agagatgcgg aagctattca 540 
ggaacaaata catgagcaac acctacgagc 600 
atgctgggct ataccagggt caggtgctag 660 
ggcccaggca gaccttgcag tcaaaatcaa 720 
ctccaaaatc atcagcaagt ccctattcct 7 80 
atagcactag cacctgtggg atgcacagtt 84a 
ctgcttcgta taattgtcag gagccaccat 900 
ttggaaacct gggaaacacc tgcttcatga 960 
caccactgac tgactacttt ctcaaagatg 1020 
ctctggggat gaaaggggaa attgcagaag 1080 
ctggaaggga tgcccatgtg gcacctcgca 1140 
ctcaattttc tggctaccag caacaagatt 1200 
gattgcatga agatctgaac cgggtaaaga 1260 
atgggcggcc agatgcggtg gtggcaaagg 1320 
attctgtgat tgtggatact ttccatggcc 13 80 
gtgctaaggt ttctgtgacc tttgacccat 1440 
agaaagatcg agttatggag gttttcctgg 1500 
agtaccgtgt gactgtgccg ctgatggggg 1560 
ggctgtctgg cattgctgca gaaaatatgg 1620 
acaaaatttt ccaaatggat gaaggtttaa 1680 
tgtacgaggt ctgcagcact tccgtggatg 1740 
tcagggagag gaagtccagg ccatcaagca 1800 
cactattgct ttctgtcccc aagcacaagt 1860 
gtgatcgtat cagccgctat gtgaaacagc 1920 
tggagccagg ggcctgcaat ggctccagga 1980 
tggagcatca ggaagaaggc aaagagcagc 2040 
agccaggaaa tgaccccagt gagaccaccc 2100 
aaaggctttt taccttcagt cttgtgaact 2160 
cagctgatgg aaaactactt aaactcaact 2220 
gtgaaactcg gagactttac tatgatgagc 22 80 
gcatgttgca gcctcagaag aagaagaaga 2340 
tcttcaccac catggagacc cttggggagc 2400 
agcatcaaca ggccacaaaa aagtttgacc 2460 
acctcaaacg tttctcctac aacagatact 2520 
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ggagggataa gctcgacaca gtcgtagaat tcccaatcag agggctgaac atgtccgagt 2580 
ttgtctgtaa cctgtcagca aggccttatg tgtacgacct cattgccgtg tccaatcatt 2640 
atggagccat gggggttggc cactactaaa gcagcttatg tgctatttta ccaacgtcga 2700 
gatgatgaat tttataagac accttcactt agcagttctg gttcctctga tggagggaca 2760 
cgaccaagca gctctcagca gggctttggg gatgatgagg cttgcagcat ggacaccaac 2820 
taatgctgac tccacgatcc tgccaccctg tagcgccagt gtaatccccc aggagaacat 2880 
ctttgacact ctgcagactg ctagtgttct gtctaaaaac cagacaagga aatacccttc 2940 
ttttatgagc agaaggaaac aaaaaaaaaa aagaagaccg tttacctaga agaagctatg 3000 
tcaagaggct gaattatttt tatttttaaa caggtggtga gaaatttctg tgaaacctgt 3060 
gaagctgaaa agggggtggg atgggggtac tcaatggagt atgtctgatg gatcccgaag 3120 
aatggagagg aacacaggcg ctgagtatgg agcagcctgg agacccaccc cacctgcagg 3180 
gctgccctga gcgcctggat ttctggttct gatgccaata ctagtcaccc caagtagtct 3240 
gctcacagta acccaaacct caagtaaact cccctttccc tcgctatgtg cactgatatg 3300 
ggtttataat tttctgaaag ttacccactg aagcccattt ctccgttggg tcatcttgta 3360 

<210> 93 

<211> 13855 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 7507771CB1 

<400> 93 

ctccgcccct cactgcttcg ttaggttgtg cgtcatttgc ttttcgcttc gcgtagggtg 60 
aagctgtagc tacttcggct ttggtgggag ggaggagggg tctggaaagg gctgggctca 120 
ggcttttccc gtccggtaga ggttctcgcg ggatcgcgcg gaggcggcgg tggctcggtt 180 
actgactgca gtagccgcgg aggcggcggc aaggtcttcg gctcgtgccg ttctaagccg 240 
gagaattctc gcgggagcag ggttacgtcc tcgtgggatt cgttggcggt ggctgaggtc 300 
ctccagcagc ctgacctgag tgggttagtg atccagagaa accagcaggc caacttggtc 3 60 
aggaaggttc gggaagctgt tggagcagtg tggggaattt cccaccagga tgagtatgat 42 0 
tggctgtgat tttagatcgt aaagctgaaa attgaaatca tgaaagtaga caggactaaa 48 0 
ctgaagaaga cacctactga ggctcctgca gactgcagag ccttaataga caaactcaaa 540 
gtttgtaatg atgagcaact tctcttggaa ctgcagcaga tcaaaacatg gaacattgga 600 
aagtgcgagt tatatcactg ggtggacctg ttggaccgct tcgatggaat actggcagat 660 
gctggacaga cagtggagaa tatgtcatgg atgctcgtat gtgataggcc agaaagagag 72 0 
caactgaaaa tgcttctctt ggctgtgttg aacttcacag ccttgctcat tgagtacagc 780 
ttttcccggc atctgtacag ttccatagag catttgacaa ctttattggc ttcctctgat 840 
atgcaagtgg tgctggcagt cctcaatctc ctatatgtat ttagcaaaag atcaaactac 900 
atcactcgtc tgggatctga caagaggacc ccgctgctaa ctcggctaca acatttggca 960 
gagagctggg gtggaaagga gaatggcttt ggacttgcag aatgttgcag agacttgcat 1020 
atgatgaaat atccacccag tgcaactaca ctacactttg aattctatgc agatcctggg 1080 
gccgaggtca aaattgagaa aaggacaact agtaacacac tacattatat tcacatagag 1140 
caacttgaca agatttcaga aagcccttct gaaatcatgg aatctcttac caaaatgtac 1200 
agcactccta aggataagca gatgctgtta tttacacaca tacgactggc ccatggcttt 1260 
tctaatcaca ggaagcgatt gcaggcagtt caggccagac tgcatgcaat atctatatta 1320 
gtgtattcca atgccttgca ggaatcagca aacagtatct tgtataatgg cttgatagag 13 80 
gagttggtag atgtccttca gataacggat aagcagctta tggagattaa agcagcttct 1440 
ttacgaacat taacatcaat tgtccacttg gagagaactc ccaaactcag cagtattatt 1500 
gactgtactg gaactgcctc ctaccatgga tttttgccag tgcttgtaag gaactgtatc 1560 
caggccatga ttgatccttc catggatcca taccctcacc agtttgccac tgctctcttc 1620 
tcttttttat accatctggc cagctacgat gctggtggtg aagccttggt ctcctgtgga 1680 
atgatggaag ccttattgaa ggtcataaag tttcttggcg atgaacagga ccagataaca 1740 
tttgtcacca gagccgtcag agtggttgac cttatcacca acctggatat ggcagctttt 1800 
caatcccata gtggactttc tatcttcatt tatagacttg agcatgaagt agatttgtgc 1860 
cgaaaagaat gtccgtttgt gatcaagcca aagatccaga gacccaatac tacacaagaa 1920 
ggagaggaaa tggaaactga tatggatgga gtccagtgta ttccacaacg agcagcactt 1980 
ctgaaatcca tgttgaattt cctcaagaag gccatccaag accctgcttt ctcagatggc 2040 
atacgacatg tgatggatgg ttctctgcct acctccctga aacacatcat cagcaatgca 2100 
gaatactatg gcccatcact cttcctccta gctactgaag tggtgactgfc gtttgtattt 2160 
caagaaccat cactgctctc ctcactccag gacaatggat tgacagatgt catgctgcat 2220 
gcactgctta tcaaagatgt tcctgctacc cgtgaagtcc ttggctccct cccaaatgta 2280 
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ttcagtgcac 
gaacgcctct 
agttctgatc 
agacatcagc 
atctgtaatc 
gatggcactg 
gaggatgagg 
gaaactgagc 
gattacatcc 
gaccactgcc 
cttcccaatc 
tgcaaatcca 
ttggactcca 
tcagtgttgt 
caggccacac 
catacttgca 
caattgggtc 
agcactgtcc 
caggcagata 
aaaagatcag 
ttattagaag 
actgcttcag 
caaatcaagc 
ggacttcttg 
gccagcacca 
ctcttgacta 
ctgacattct 
tatccctacc 
gaaactttca 
cactcagact 
aagatggtga 
ggaggtgtcc 
gcagccttta 
cgaatggctg 
cgagagagac 
gaaggtggct 
atgggcttca 
gccacagagt 
agcatgtctg 
ccaatggatc 
cggaaagctc 
gctgacccgt 
ttccaccttc 
gcaatcaaac 
gtgtgggaag 
aaaaccgtgt 
actagaatct 
gttgaatcaa 
ctccaggcag 
ctcctgattg 
aagtacctgc 
tgtagttaca 
agcgtgcgat 
gttaatgagg 
ctgaataaaa 
aaagaaatgg 
agtacaaaca 
cagggcttga 
gtccctgtgg 
gaccacaaat 
acccagagct 
attgaggacc 
agtggagctg 



tctgtttgaa 
tcaaagttct 
cccttgggga 
ccacccttaa 
ttggaaggga 
ccactgctcc 
aggaagagga 
ctaatcagca 
ttaatgtgat 
aggaatttgt 
tgcccattga 
tattgacact 
tcctctcctc 
tgcgagaact 
ctctgctgca 
gagttggaca 
tgagtgtttt 
tcctctctct 
tgcagaaact 
atggggaaca 
gcattgggct 
actctaaggg 
ctttgttatc 
ttaaactttg 
ctacagcacc 
aggggttatc 
tcatctgttc 
acctcatgct 
actgggctct 
tgcctgatgg 
atcccaccac 
agaactttcc 
cttgcatcaa 
aatcgatgct 
taagcaagga 
cccgccggga 
caagggaaca 
accttttaac 
aagaggacca 
aaagggcaga 
gggaaaagca 
tggaacaaga 
ttgatgagct 
gtaatggagc 
ctgctgatgt 
cagagtggat 
tgcttttaac 
gtggcatcct 
ccaaggagca 
atttctatga 
aatccaacag 
gtgcaagcaa 
tcactgcagg 
aaacagggaa 
attcaaaaaa 
atatcaaacg 
ctgaaaagga 
cagaagatat 
acccagatac 
atgccatgat 
caggcttcaa 
cctgtaccct 
gtagcactac 



tgcccgaggt 
tctgtctcca 
tactgcatcc 
aacagatgca 
ccccaaatac 
tcccccaagg 
agtacaggcc 
ggttgttggt 
gaaatttgtg 
gaatcagaaa 
ctttcccaca 
gtcacatgaa 
cctggagccc 
ggcttgcgca 
tgcactcact 
gagtgaaatt 
gagcaagctg 
gtgtacccca 
ggttccaaag 
ggatggagca 
agatggtgac 
caaatctaaa 
agcttcctcc 
tgtgggatct 
gacacctgcc 
ttggcagccc 
agttggtttc 
gcaaaaattt 
gtccatggga 
cacaggagaa 
ggtgcttgaa 
ccagttcagt 
aaacttatgg 
ggccattcta 
gaaggagggg 
acctcaagtc 
tgcaatggag 
ccaccctcct 
gatgatgaga 
gtcacctgag 
ggaggaggaa 
tgagctccac 
gccagacaca 
agattatcgt 
attgatcaaa 
aagtcagatg 
gctacttttt 
taatgtccta 
gaaggaagtc 
aaagacagcc 
caacaactgg 
caatagcact 
ccgaagaaga 
ccgacgccct 
cagcaatgga 
taaagaaaat 
gacaagcctg 
ggtgactgtt 
tttgcatgcc 
gtttgcagaa 
tgggtttact 
tcgtcatacc 
ctctggtgtt 



cttcagtctt 
gattacctcc 
aacctgggga 
acgactgcca 
atctgtcaga 
tctaatcatg 
atgcagagct 
acagaggaac 
gaatctattc 
ggactgttgc 
tctgctgcct 
cccaaagtcc 
ttacaccgcc 
ggcaatgttg 
gctgcccatg 
cgttccatct 
agccagtfcat 
aacagcctac 
gatgagaagg 
gctggaagta 
acattggctc 
atcacaccag 
agattaggcc 
cctgtccgcc 
gcgcgatcaa 
ccaccatata 
acatccccaa 
ctctgctccg 
ggtaaagttc 
ttcctagatg 
tctccacatt 
gcactgcgct 
aaccggaaac 
tgccacatcc 
tctcgaggag 
aaccagcaac 
gcactgttga 
ccaatcatgg 
gcaattgcta 
gaagttgctt 
gaggctaaat 
actttcacag 
gtataccgtg 
gacatgattc 
gctgctcttc 
gccacactgc 
gaggagttga 
atcaaactct 
cagaccccaa 
atctcctcaa 
cgctggtttg 
attgattctg 
tacacggtcc 
gtgatgctga 
caggaactag 
aaaggcaatg 
gaggaaacaa 
ttaatccggg 
acccttcgtc 
ctgaagagta 
cccctggtca 
atggaaaagg 
gtgtctggca 



ttgttcagtg 
cagccatgcg 
gtgctgtcga 
tcatcaagtt 
agccatcaat 
ccgcagaaga 
ttaattctac 
gtattcctat 
tgagcaacaa 
ctttggttac 
gtcaggctgt 
ttcaagaggg 
ccattgaatc 
ctgatgctac 
cctacatcat 
ccgtaaacca 
actgttccct 
catctgggtg 
caggtacgac 
tggatgcttc 
ccatggagac 
caatggctgc 
gagcacttgc 
agagaaggag 
cagcctcagc 
cacctactcc 
tgctgtttga 
gaggccacaa 
ctgtttctga 
cctggcttat 
cgctgcctgc 
tccttgtggt 
ccctgaaggt 
tccgaggaga 
aagaggatac 
aactgcaaca 
acaccagcac 
gaggagttgt 
tgtctctggg 
gccggaagga 
gtctagagaa 
atactatgtt 
tgtgtgacct 
tgaagcaagt 
ccctgacaac 
cccaggcctc 
agctaccttg 
tggaagtggt 
agtggatcac 
aaaggagagc 
atgatcgctc 
cctggaaatc 
aattcactac 
ctctcctcag 
agaagacgct 
ataccccttt 
aaatcgggga 
cctgcgtgag 
tctgtctgag 
cccgcatgat 
cccttctctt 
ttgttcgctc 
gcctcggctc 



tcagcctttt 
gaggaggaga 
tgagctcatg 
acttgaagaa 
ccagaaggca 
agcctctagt 
ccagcaaaat 
tcccctcatg 
tacaacagat 
cattttgggt 
tgcaggtgtc 
tctccttcag 
ccctgggggc 
cctctcagcc 
gatgtttgtt 
gtggggctct 
ggtgtgggaa 
tgaatttggc 
ccagggcgga 
tacccagggc 
agatgaacct 
cagaattaag 
tgagctattt 
ccatcatgct 
tctcactaag 
ccgattcagg 
tgagaggaag 
tgctcttttt 
gggattggaa 
gctggtggag 
caaattgcct 
aactcagaaa 
atatggtgga 
acctgtgatt 
agggcaagag 
gctcatggac 
catggagcag 
tcgggatctc 
acaggatatt 
ggaagaggaa 
gttccaggat 
gccaggctgc 
gatcatgaca 
agtcaatcag 
aagtgacaca 
caatttggct 
tgcttgggtg 
tcagccctgc 
accagtgttg 
ccagatgact 
tgggcgttgg 
tggagagaca 
aatggtgcag 
ggtacctcgg 
ggaagaaagc 
ggccctagag 
gatcctgatc 
catgctggga 
gctcacccgg 
cttgaatttg 
aagacacatc 
agcagctaca 
tcgggagatc 



2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620. 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 



92/112 



wo 03/063688 



PCTAJS03/02500 



aactacatcc ttcgtgtcct tgggccagcc gcatgccgca atccagacat attcacagaa 6120 
gtggccaact gctgtatccg catcgccctt cctgcccctc gaggctcagg aactgcttca 6180 
gatgatgaat ttgagaatct tagaattaaa ggccctaatg ctgtacagct ggtgaagacc 6240 
acccctttga agccctcacc tctgcctgtc atccctgata ctatcaagga agtgatctat 6300 
gatatgctga atgctctggc tgcataccat gctccagagg aagcagataa atctgatcct 6360 
aaacctgggg ttatgaccca agaggttggc cagctcctgc aagacatggg tgatgatgta 6420 
taccagcagt accggtcact tacgcgtcag agcagtgact ttgatacgca gtcaggtttt 6480 
tccattaata gtcaggtctt tgctgcagat ggtgcctcca ctgagacttc cgcatctggg 6540 
acctcccaag gagaggcttc aactccagag gagtctcgag atgggaagaa agataaagaa 6600 
ggggaccggg cctctgagga aggcaaacag aaaggcaagg gcagcaaacc tttaatgcct 6660 
acctccacta tccttcgtct tctggcagag ttggtgaggt cctatgttgg tattgctacc 6720 
ctgattgcca actacagcta cactgtgggc cagtctgaac tgatcaaaga ggactgcagt 6780 
gtgctagctt ttgttctgga ccacctgctc ccacataccc agaatgcaga agacaaggac 6840 
acccctgcct tggcccgcct gttcctcgca agcctggctg ctgcagggag tggcacagat 6900 
gcccaggtgg ccctagtgaa tgaagtaaaa gcagcccttg gacgggcact ggctatggct 6960 
gagagtacag agaaacatgc caggcttcag gcagtgatgt gtatcatcag tactatcatg 7020 
gagtcctgcc cctccacctc cagcttctac agcagtgcca cagcgaagac ccagcacaat 7080 
ggcatgaaca acatcattcg gcttttcctg aagaagggac tggttaatga cctggccaga 7140 
gtacctcaca gcttagacct gtccagtccc aacatggcca acacagtcaa tgctgctctg 72 00 
aagcctttgg aaacactttc ccggattgtg aaccagccca gtagcctttt tggcagcaag 7260 
agtgcttcta gcaagaacaa gtctgagcag gatgcccaag gagcctctca agattccagt 7320 
agcaaccagc aggacccagg cgagcctggg gaagcagaag tgcaggagga ggatcatgat 73 80 
gtcactcaga cagaggtggc agatggggat atcatggatg gggaggctga aaccgactca 7440 
gtggtgattg ctgggcagcc tgaggtgctc agttcacaag agatgcaggt tgagaatgag 7500 
ctggaggacc tgatagatga gttgcttgag agggatggcg gatctgggaa cagtacaatt 7560 
atagtgagca gaagtggaga ggatgaatca caagaggacg tgctgatgga tgaagctcct 7620 
tccaacctca gccaagcttc caccttgcag gccaaccgag aagattccat gaatatcctg 7680 
gaccctgagg atgaggagga gcacactcag gaagaggaca gcagtggcag taacgaggat 7740 
gaggatgata gtcaggatga agaggaggag gaggaggaag atgaggaaga tgatcaggag 7800 
gatgatgaag gtgaagaggg agatgaagac gatgacgacg atggctctga gatggaattg 7860 
gatgaggatt atcctgatat gaacgcttct cccttggtcc gatttgagcg ctttgaccgg 7 920 
gaggatgatc tcatcattga gtttgacaac atgttctcca gtgctacaga catcccccca 7 980 
tccccaggaa atatccctac cacccatcca ctgatggtgc gccatgcaga ccacagttct 8040 
ctgacactgg gcagtggctc ttcaacaact cgtctcaccc agggcatcgg gcgcagtcag 8100 
aggaccctaa ggcagctgac ggccaatact ggccacacca ttcatgttca ctaccctggg 8160 
aatcgccagc ccaaccctcc tcttatactg cagaggttgc ttggtccctc agctgctgct 8220 
gacatccttc agctgagcag cagccttccc ctacaaagcc ggggtcgggc ccgcctcctg 82 80 
gtaggcaacg atgacgtcca catcatcgcc cgttctgatg atgagctgct ggatgacttt 8340 
ttccatgatc agagcacagc taccagccaa gcaggaaccc tgtccagcat ccccacagcc 8400 
ctgacccgct ggacagaaga atgcaaagtt ctcgatgctg agagcatgca tgactgtgtt 846 0 
tcagtggtta aagtgtccat tgtcaatcac ctggaattcc tgagggatga ggagctggaa 8520 
gaaaggcgag agaagcgcag gaaacaactg gctgaggaag aaacaaagat aactgataaa 8580 
ggcaaagaag ataaggagaa cagggatcag agtgcacagt gtactgcatc taagtcaaat 8640 
gactccactg aacagaatct ctcagatggg acgcctatgc ctgacagcta cccaacaacc 8700 
ccatcttcaa ctgatgcagc tacatctgag tccaaggaga cccttggcac tctgcaatcc 8760 
tcacaacagc aaccaacact cccaacccca ccagctttgg gagaggttcc tcaggagctg 8820 
cagtctccag ctggagaagg gggcagctct acacagctat tgatgcctgt agagccagag 8880 
gaattgggtc ccacaaggcc aagtggggaa gcagaaacaa ctcagatgga gttatcccca 8940 
gctcccacta taacctcact ttccccagag agagctgagg attctgatgc actgacggct 9000 
gtcagcagtc agctagaagg ctctcctatg gatacaagca gcctggcttc ctgtacctta 9060 
gaggaggctg tgggtgacac ttcagcagct ggcagttctg agcagcccag agcaggcagc 912 0 
tccactcctg gggatgcccc accagctgtg gcggaagtgc aaggcaggag tgatgggtca 9180 
ggggaatctg cccagccacc tgaggacagc tccccacctg catcctctga gagctcttcc 9240 
accagagatt ctgccgtggc catttctgga gcagattccc gaggaatcct agaagagccg 9300 
ttgccttcaa caagcagtga agaagaagat ccccttgcgg gtatcagtct ccctgaaggt 9360 
gtggacccct cttttctggc tgccctgcct gatgacatcc gtcgggaagt tctacagaac 942 0 
cagctaggca ttcgtccacc aacccggact gccccctcca caaatagctc agcgcctgca 9480 
gtggtgggga atcctggtgt gactgaagtg agccctgagt ttctggctgc cctgcctcca 9540 
gccattcagg aggaagtact ggcacagcag agagctgagc agcagcgacg agaactagca 9600 
cagaatgcca gctcagacac ccctatggac cctgtgacct tcatccagac tctgccctca 9660 
gacctgcgcc gtagtgtcct agaggatatg gaggacagtg tgttagctgt gatgccacct 9720 
gacattgcag ctgaggctca agccctgaga cgagagcaag aagcccggca gcgacagctc 9780 
atgcatgagc gtctgtttgg gcacagtagc acctccgcac tctctgctat tctccgaagc 9840 
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ccggctttca 
cagagaggtg 
aatgtagata 
tgtctcttgg 
gtactgagaa 
tccatcttgc 
agtgaggaaa 
cccctggacc 
gtatccatgg 
gggcgtaaac 
gctgctcctg 
cccagccact 
ggcaataagg 
. ttctgggact 
gtgaagtcag 
gcctctccac 
ctcttaactg 
aaggtgtcag 
tcaaccacat 
cccacccctg 
gtagctgctt 
cccactacta 
gactttaaga 
ttgacatccc 
ctctcccggg 
cgccatctgg 
tacaacctcg 
gaggagcagc 
gagaatgtgg 
ccttctatgt 
caggtcatca 
ggcaggctag 
ctggaggctg 
cagcaacaag 
tcacccatgg 
gatggaaccc 
agcgagcagc 
gaggaatccc 
ctggtccatg 
ctggcacaca 
ccttcctccc 
ctgccccctg 
cagatcctac 
tacattcgtg 
ttagatgagg 
gaagactcct 
atagtatttg 
atcatctctc 
cgagtcacct 
aagtttgtcg 
tttactcgat 
agtgaagatt 
ctaggctatg 
gacctcaaac 
cacctggtat 
gaaggcttct 
gagctgctta 
taccacaagt 
ttcgatcaag 
ctgcaaggct 
gatgacaggt 
cctgcctatg 
tctgaaggct 



ccagtcgctt 
gcaccttcca 
ctctcctccg 
tcctactttt 
atctctgcta 
agcgcagcag 
agggcaaaaa 
tgctacacaa 
atgcagccct 
ataccgagaa 
ttgtctgcag 
tcacacagca 
cctgtagccc 
tattggtaaa 
tgccagtgag 
tggggcagct 
agaaactcct 
aagcacaggc 
ctaccaccac 
tcacttctgc 
cgaccacagt 
agggcagcaa 
tggtgtcctc 
actcttgttc 
gggactctgg 
gttataccct 
agcagcagcg 
cacagaccac 
taattgtggc 
ccatgttgac 
tccagctccg 
gttcctccgg 
aggctgatgc 
cagcaacgtc 
atgtggacca 
cacaggggga 
tgagtttgga 
atgaccagca 
ccacagagcg 
tcaaggacga 
ttgacccatt 
acacacagaa 
ggcagtccac 
tcctcgactt 
ggctccggaa 
atcgtgagct 
aaggagaaga 
gagagatgtt 
acaccatcaa 
gacgcattgt 
ccttttacaa 
accacttcta 
acctcacctt 



ccaatggggc 
gccagatgag 
atgagatcat 
tatcaggact 
accagtccaa 
ctgaccgtgc 
ttgctgccct 
ccacagatcg 
agagctttga 
ttgggctggc 



aagtggcaac 
gatggggggt 
cctccgagga 
tgtggatgag 
ccatgcccag 
tgagagtgag 
gtcgagcaag 
gatggagtca 
aggctgcagg 
gcatgcaagc 
acacgttttg 
gcggaccaaa 
atgctcctca 
actggacaac 
cgctggcggt 
catgaacatg 
cagactcctt 
taattctggc 
caccactgcc 
tccagccctg 
gactaccccc 
atctccagcg 
tggcctcact 
tgaggaaggc 
gacccgggac 
ttgtaaacaa 
gcgagcccaa 
caagctgaag 
atctcagaag 
atccaagaca 
ggacgacacg 
tttaggctca 
cattatacaa 
ggagtctagc 
gccatctccc 
gaaggaaaag 
cgagctgtgg 
tgcggtgcta 
ggagagcaag 
gcctcctcca 
cttctcccgg 
gttccttcgc 
gacccacctt 
tgatgtcaag 
agaagacatg 
gcatcgcaaa 
agggcaggat 
taaccctatg 
tccatcttcc 
ggccaaagct 
acacatcttg 
ccaaggtctg 
cagcactgag 
caacatcttg 
aatgacagga 
tccaaagcgc 
gcccaccatt 
ctctattcag 
caagttcctc 
cgaaggcatg 
cctgccttca 
gaagctccgc 
ctaataaggc 



cgtggggtcc 
agcagcagcc 
cggctccttc 
ccaaagctca 
acccgccact 
ctatgcattg 
agctgtgggt 
aagagctcca 
actaatatat 
ggtggctcca 
gatacactca 
gaaacaaact 
cagtcctcca 
atgaatgtca 
gagggggaaa 
ttgtcacacc 
tctctcatct 
agcggtgctt 
gcctccacca 
gttgctgcca 
acgactgcta 
aaggtgagtg 
gaaaaccagc 
ttagaggatg 
actgttctca 
ataggtaccc 
tgtgaaaccc 
ggcaaaatgc 
cgacctttgg 
tctacccaga 
cgccgggcta 
gctagcagca 
atggtacgtg 
cagtcagagg 
agtgctcaag 
gaagaaagac 
gacatgcttg 
gtgctacagc 
cctcctgtcc 
ctctcccctg 
gagccctcat 
tttgcagaga 
gctgatgggc 
cgcaaatatt 
gctgtgcatg 
tcccccgaag 
gctggcgggc 
tatgccttgt 
cactgcaacc 
gtatatgaca 
ggcaagtcag 
gtttatctgc 
gtccaagagt 
gtaacagagg 
gccatccgca 
ctcatttcca 
gacatcgatg 
atccagtggt 
cagtttgtca 
aatggcattc 
gctcacacat 
cacatgctac 
cctgcccaac 



agtatactcg 
ataacaggcc 
tggaccacga 
atactagccg 
gggtcatccg 
aaacacccaa 
caagtagcca 
accagctttc 
ttcagatcca 
ccgtccacat 
ttcaattggc 
gtgagagtga 
gcagtggcat 
gccggaaagg 
cctctccata 
cagtcatccg 
caattgctct 
cctccaccac 
cgcccacacc 
cggctatttc 
ccactactgt 
atgggggcag 
tacagctctc 
cagccaacgt 
agctgctact 
tgctggccga 
tctctcctga 
agagcaggtt 
gtggccggga 
agttcttctt 
acaagaaagc 
tccaggcagc 
agggtcaaag 
cgtctgtccg 
atactcaatc 
cacctgagtt 
gggagtgtct 
ctgctgtcga 
gagacacccg 
cccccttaac 
ctatgcacat 
ctcaccgcac 
cttttgctgt 
tccgccaaga 
tccgtcgtga 
aaatgaagaa 
tcctgcggga 
tccgtacctc 
ccaaccacct 
accgtcttct 
tcagatatac 
tggaaaatga 
ttggagtttg 
agaataagaa 
agcagttggc 
tcttcactga 
atctgaaatc 
tctggagagc 
cgggtacttc 
agaagtttca 
gttttaatca 
tgttggctat 
tccgtggggt 



ccttgctgtg 
ttctggcagt 
agccctttct 
tctacaccga 
cagtctgctc 
actcactaca 
tgagaaccgt 
ctggctctca 
gcgttcaggg 
ccatccccaa 
caaggtattt 
tcgggaaagg 
ttgcacagac 
caagaactcc 
cagcctcgag 
ccggagctct 
cccagaaaac 
cactgccacc 
ccctactgca 
caccattgtc 
ttcaatttct 
cagcagtaca 
tgtagaggtg 
actactgcag 
gaatggagcc 
gctgcgggaa 
tggcctgcct 
tgacatggct 
gctccagctg 
gagggtacta 
caagcagaca 
tgttcggcag 
ggcgcggaga 
gagggaggaa 
cattgcctcc 
acccctgctc 
aaaggaacta 
ggccttcttt 
tgagagccag 
cccagccacg 
ctcctcaagc 
tgtgttaaac 
cctggtagac 
gctggagcgt 
ccatgtgttt 
tcgattgtat 
gtggtatatg 
acctggtgat 
cagctacttc 
ggagtgctac 
agatatggag 
tgtctccaca 
tgaagttcgt 
ggagtatgta 
ggctttctta 
gcaggagtta 
caacactgaa 
attgcgttct 
caaggtaccc 
gatccatcga 
gctggatctg 
ccaggagtgc 
tttttttacc 



9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
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attgttggac ctggggaggg gggagttaaa aaaagaacca gaaagaaatt gtcaaaaacc 13680 
aataaatgaa atccaccaac tcaccgtgtg tgtcccagct gccccatctt ccccagcgca 13740 
tacctgttcc tcttctcatt ctctccccgc cgcctgtttc ctcaccttct ctcccctttc 13800 
catgccgtcc atgatcccca ccccatgtgt tttaaaaagg cagtagcctt tgcag 13855 



<210> 94 

<211> 1554 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7504732CB1 



<400> 94 

cggagactgg ccgggtagcc ccgcccccgt agttagcgcg gtgcttctct tccgctccgg 60 
gtcggctccg tttccctttc cgggcgggca ggcggcggac cccagtgtct ttatccctct 120 
tttgcacagt cagcttctgc agctctcccg ggctagcatg gcagcgtgga agagttggac 180 
ggccctgcgg ctctgcgcca cagttgttgt acttgatatg gtcgtctgta aaggatttgt 240 
agaagattta gatgaatcgt ttaaagaaaa tcgaaatgat gacatttggc ttgtagattt 300 
ttatgcgcca tggtgtggcc attgtaaaaa gctggaacca atttggaatg aagttggtct 360 
tgagatgaaa agcattggtt ctccagttaa ggttggaaag atggatgcta cttcctattc 420 
taattaaaag gggacttggc atataattat agaggaccac gaacaaaaga tgatattatt 480 
gagtttgctc acagagtatc tggggctcta attcggccac ttccaagtca acaaatgttt 540 
gaacatatgc agaagagaca ccgtgtattt ttcgtttatg taggtggaga atcacctttg 600 
aaagagaaat acatagatgc tgcttcagaa ttgattgtat atacatactt cttttctgcc 660 
tcagaagaag tggttcctga gtatgtgaca ctaaaagaga tgccagctgt gcttgttttc 720 
aaagatgaaa cttactttgt ttatgatgag tatgaagatg gtgatctgtc atcatggatc 780 
aacagggaaa ggtttcagaa ttaccttgct atggatggct tcctcttgta tgaacttgga 840 
gacacaggaa agcttgtggc tcttgcagtt attgatgaga aaaatacatc agttgaacat 900 
accagattga agtcaattat tcaggaagtt gcaagagatt acagagacct cttccatagg 960 
gattttcagt ttggccacat ggatggaaat gactacataa ataccttgct gatggatgaa 1020 
ttgacagtcc caactgtagt tgtactgaat acttcaaacc agcaatattt cttgctagat 1080^ 
agacagatta agaatgttga agacatggtc cagtttatta ataacatttt ggatggcaca 1140 
gtagaagccc aaggaggtga tagcattttg cagagattga aaagaatagt atttgatgcc 1200 
aaatctacta ttgtgtctat attcaagagc tcaccactga tgggctgctt tctctttggc 1260 
ctgccactgg gtgtcatcag tatcatgtgc tatggaatct acacagccga cacagatgga 1320 
ggttatatag aagaacgata tgaagtgtct aaaagtgaaa atgaaaacca agaacagata 1380 
gaagagagca aagaacagca ggagcccagc agtggaggat ctgtagtgcc tacagtgcag 1440 
gagcccaagg atgtattaga aaagaagaaa gattgagact tgatgactat aaaatatttg 1500 
ttaggacttc aaattattaa agagtctatt tattgaattt agacatttaa tctg 1554 



<210> 95 

<211> 2142 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 950917CB1 



<400> 95 

cgcattttac actccctttg gtttctaccc cttcgctttt tggtgcccct ttaatctctg 60 
gaccccttaa ctcccaacgt tctgtggaaa attttttttt tttttttttt ttttttttca 120 
atcatcctct ttatagatga aatataataa cagatactag tgagattgtt gggatgatta 180 
actgcaaatg tacaaagtcc tttttaacat agtagataat tatttattat tattcctgat 240 
agacacagca cagctataag ctgactatat accaagaaca atgagtagga ttacctctta 300 
catttctttt cccaacataa tgctttgctc ctgatctgac cagggctttg ctaaagtcac 3 60 
acatccagta agtggccgaa gcagatcttc agacccacag tttggctgcc aaaggaccac 420 
agtgagtatt tctaatgctg aaacaagagg gtgcagctct gcagcccaga cggctgggcc 480 
tcagggtagg taaccaagac agctcaatgc atctctccct cagccgtccc tagcccctaa 540 
aaacagagct gggggtccca gtcctctgtg ggacatggcc ctctttcatc accagcttcc 600 
agcctaccct gcatctcttc cctctgaatt gcaggaaccc tcccagcctc catgccgcct 660 
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ttcctgatca ccctcttcct ctttcactct 
gaaggaatga cattgctgaa gactgagttt 
tgtagaaatg agacgaactt tgtcatctct 
ctgcagtttg gagcagaagg gagcactggt 
gtccatgaca aaagggtgaa agatttcttg 
agccaaggca ccgagatgga gctggcctgc 
tccccctgct ttgtggagca cgtctcctgg 
ctcagtgagc ccaatagcac cgccatccag 
ggtgggggcc ccagtgaggg ccctggtggc 
gctcagcttg tgcttgtgag caccatgtcc 
tccacagaca cacagatcct gcctttcacc 
atgatgcacc aaacgaccga ggtcaactac 
gtgggggtgc tggagcttcc ttacctggga 
cgtgacaaag acacccccct gagccacatc 
ctctggacca ccagcctgag gagagccagg 
caaaatcaat tcaacttaaa aagcatttta 
ccactcaaag ctaacttgaa aggaatttca 
atccacaagg ccaagattga agttttggag 
ctgttgttat tgaaaaggtc tcggattcct 
ttcctgagag aacctaacac aggtattaca 
cagtgtctct ctagcaacaa gggctcattt 
ttctagctgt ttttatggtg cttcagaagc 
ggggactatt tcgtaagtac aaaggaaact 
cctcagctca tatatagtta atgtaaattt 
gaagacactt ggaaaaatca tgaaagtaaa 

<210> 96 

<211> 2637 

<212> DNA 

<213> Homo sapiens 



tgctgcctcc gagcaaatgg ccacctccgt 720 
gcacttcacc tctaccagag tgtggccgcg 7 80 
cctgctggtg tgtccctccc cctggagatc 840 
cagcagctgg cagatgccct ggggtacact 900 
catgctgttt atgccacact acccacctcc 960 
agcctttttg tgcaagtggg aacgccactg 1020 
tgggctaaca gcagcctgga accagccgac 1080 
actagcgaag gggcctccag agagactgca 1140 
tggccgtggg agcaagtcag tgcagcattt 1200 
ttccaaggca cttggcgaaa' gagattctcc 1260 
tgtgcctatg gcctcgtcct tcaggtcccc 1320 
ggtcagttcc aggacactgc aggccatcag 13 80 
agtgcagtga gtctgttcct ggtgctgccc 1440 
gagccacacc tcacagccag caccatccac 1500 
atggatgtgt tcctgcccag gtttaggatc 1560 
aattcttggg gagtcaccga tctttttgat 1620 
ggccaagatg gcttttatgt ttctgaagca 1680 
gaaggcacca aggcatctgg agccacagct 1740 
atttttaaag cagatcggcc attcatctat 1800 
gtattttttg acaggattca gataatttat 1860 
gtccactatc ccctcaaaaa taagcattct 1920 
tacccagttg ctatcttcta tcctaagaaa 1980 
tatcctccat gttttttacc ttctcattct 2040 
ttttttttaa tgctataggt ttctacttct 2100 
ataaattata aa 2142 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7459720CB1 

<400> 96 

ggactataga agagcccagt gtgctggaaa 
acactactca cagtgaaagt gatgcctttt 
ttagccagac caggagctca acctcatgta 
ggaggcagaa agtgccacac ttggatgccc 
aattagccgg gctcttctta ctggttccct 
ctgatggcca gggcacaggg gggccgagct 
acggggagga agagactctg gagcagcagg 
tgaatgctgg acgcatcgga gagcttcagg 
acaggccggc cctggaggtg gaggccatcc 
atgaagctgt gcgctggcac tcagagcaga 
acttcaacga ccccaagtac ccgcagcaafc 
gggacatcaa cgtgacgggt gtgtgggaac 
tggtagtgga tgacggagtg gaacacacca 
agggtagcta tgacctcaac tctaatgacc 
atggcaacca ccatggcacg cgatgtgcag 
tctgtgccgt gggcgtggcc tacgggagcc 
ctctcacaga cagcatggag gcagtggcgt 
acagctgcag gtatttattt ctcaacagct 
atggccccca tcagcttgga aaggctgcct 
gctttgggag catctttgtg gtagccagtg 
actacgatgg ctacgccaac tccatctaca 
gacgcatgcc tttctatgca gaagaatgtg 
gggacaagat gcttcggagc attgtgacca 
gcactgaggg ccacacaggg acctcagctg 
taatgctgca ggtgcggccc tgcctcacgt 
cagccacccg gtatgaggat cgccgtgcag 
atagccacca gcacggtttc ggcctcctca 



gggggccggt ctgtagcctt tgctgagtca 60 
ctccatttgc acaacatgag tgtgacgtgg 120 
gaatccagtc cactgctctg atgccgaagg 180 
ccctgggcct gcccacctgc ctctggctgg 240 
gggtcatggg cctggcaggg acaggtgggc 300 
gggctgtgca cctggaaagc ctggaaggtg 360 
cggatgcctt ggcccaggca gcagggctgg 420 
ggcactacct ctttgtccag cctgctgggc 480 
ggcagcaggt ggaggctgtg ttggctgggc 540 
ggctgctaag gcgggccaag cgcagcgtcc 600 
ggcacctgaa taaccgacgg agcccgggca 660 
gcaatgtgac tgggcgaggg gtgacggtgg 720 
tccaggacat tgcacccaac tatagccctg 780 
ctgaccccat gccccacccg gatgtggaga 840 
gagagatcgc ggctgtgccc aacaacagct 900 
gcatcgcagg tatccgggta ctggatggac 960 
tcaacaagca ctatcagatc aatgacatct 1020 
ggggaccaga tgacgatggg aagacagtgg 1080 
tacaacatgg ggtgattgct ggtcgccagg 1140 
gcaacggagg ccaacacaac gacaactgca 1200 
ccgtcaccat aggagctgtg gatgaggagg 1260 
cctccatgct ggcagtcacc ttcagtggtg 1320 
ctgactggga ccttcagaag ggcactggct 1380 
cagcgcctct ggcagctggc atgatagcct 1440 
ggcgtgacgt ccagcacatc attgtcttca 1500 
agtgggtcac caacgaggca ggcttcagcc 1560 
acgcctggag gctcgtgaat gcagccaaga 1620 
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tctggacatc tgtcccttac ttagcatcct 
cgattccgca gtccccccgt tccctggagg 
agatgtcagg gctgaagacc ctggagcatg 
ggcgcggcag cttggagctg aagctgttct 
ccccccgcag catggactcg gatcccaacg 
gatgctgggg ggagagagcc cgagggacct 
agtcattcca ggtcggcatc ctccggcaat 
gtgcagtaga catcagggac agacaaaggc 
tgcacgatga cttcgccctg ccctgcccac 
acaccatcac ccGcaacacc ctcaagaccc 
ggactgttta ctacatgctg gaagtatatt 
tttgtaggag tggaccctgc cactggcccc 
cagagctaga atcagtgcca ctttgcagca 
gcaggggccc tcccaccacc tctgacctcc 
ggagcctgtc ccagaacaag agcgccctgg 
acgggaagga ggagcagatc tgctgacctc 
ttcccaaatt agggagctct tgacagaaag 



acgtcagtcc cgtgttaaaa gaaaacaagg 1680 
tcctgtggaa tgtcagcagg atggacctgg 1740 
tggcagtgac agtctccatc actcacccac 1800 
gccccagtgg catgatgtcc ctcatcggcg 1860 
gcttcaatga ctggaccttc tccactgtgc 1920 
acaggcttgt catcagggat gtcggggatg 19 80 
ggcagctgac aetata tggc tctgtgtgga 2040 
tgttagagag tgccatgagt ggaaaatacc 2100 
cggggctgaa aattcctgag gaagatggtt 2160 
tggtgctggt aggctgtttc accgtcttct 2220 
tgagccagag gaatgtggct tccaatcaag 2280 
atcggagccg gaaagccaag gaggaaggga 2340 
gcaaggatcc agacgaagtg gaaacagaga 2400 
ttgccccaga cctgctggag caaggggact 2460 
actgccctca tcagcaccta gacgtaccgc 252 0 
agggcctgac agtgtgggac aggctcttct 2580 
cagttctgat gcttacatct ggaatct 2637 



<210> 97 
<211> 2447 
<212> DNA 

<213> Homo sapiens . 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7503300CB1 



<400> 97 

agtgcccacg tcctgagccg cgcccagact 
ggctgctgcc ttggaaacgc ggagcgcgcg 
gcgttggagg aagggactgc tctggtgcta 
tgggagccag agcagcagct ctgccgcccc 
ggatccgggc ccgaagggtc agcaccagct 
agccccaggg cagccgcttg gacctgttcg 
agctgagtgc cccagaagga tttcatattg 
tgcttgtgga ccgtgcatgt tccaccccac 
agctctcgga ttccttatgc agagtggccg 
ctgagccagc attcagagaa gctgcggaag 
agaagttgaa cacaaatgtg gatttatatc 
aacttgtgga ttcccttgat ccagaaacaa 
ttgaaattag tggaatccat ctagacaaag 
ttaaaatctt ggatttgagt agtacatttc 
agaagcatct cttaccagaa cacattcgtc 
taattgatgg tctccacgca gaatcaccag 
tttttcttta tcccaatgct ggtcaattga 
atcttctggc aaagttggtg gggtattcca 
tagctaaaaa tccagagact gtcatgcagt 
aaagaactct gaaagatttt gagatgatac 
attccgtata atattgagcc cagcctatat 
gaaggcctga atattttgct taacagactg 
gcaaaaggag aggtgtggag cgaagatgtc 
ggattgttgg ggtacattta ctgtgatttt 
tgccatttca ctatccgtgg aggcagacta 
gtagttctta tgctgaatct tccccgttcc 
ggcatgatgg aaaatctttt ccatgaaatg 
actcgttacc aacacgtcac tgggaccagg 
attctgatgg agtactttgc aaatgattat 
cagactggac agccactgcc aaaaaatatg 
tgtgctgcag ctgatatgca acttcaggtc 
gggaagcatc ccctgaggaa ttcaaccaca 
tatggcctac catatgttcc aaatactgcc 
tatggtgcta gatattactc ttacctcatg 
gagtgttttc tacaggatcc tttcaacagg 



cggaagcggg ccgtcacgtg gggcggcgtg 60 
ctcccagcga aagcagcagg gcagggatct 120 
gaatgctgtg cgtcggaagg ctgggcggct 180 
gccgggcggg ccggggaagc ctcgaagccg 240 
ggtctcccgt gggcgccgcc ttcaatgtca 300 
gcgagcgccg gggtcttttt ggagttcctg 3 60 
cacaagaaaa agccttgaga aagacagaat 420 
ctgggcccca gaccgtgctg atcttcgatg 480 
acttggctga ttttgtgaaa atcgctcacc 540 
aagcttgtag aagtattggc accatggtag 600 
aaagtttgca aaaattacta gctgataaaa 660 
ggcgagtggc tgaactgttt atgtttgatt 720 
aaaagcgtaa aagagcagtg gacctcaatg 780 
ttatgggaac caattttccc aacaagattg 840 
gtaactttac atctgctggg gatcatatca 900 
atgacttggt gcgagaagct gcttataaaa 960 
aatgtttaga agaattgctc agcagcagag 1020 
cgttttctca cagggctctc caaggaacga 1080 
tccttgaaaa actatctgac aaactttctg 1140 
gagggatgaa aatgaaactg aatcctcaaa 1200 
tgcccgtttt tctctcttgg agcatgcatg 1260 
ttgaggattt cattatatgc agagcagcct 132 0 
cgaaaactgg ctgttgttca tgaatctgaa 13 80 
tttcagcgag cagacaaacc acatcaggat 1440 
aaggaagatg gagactatca actcccagtt 1500 
tcaaggagtt ctccaacttt gctaactcct 156 0 
ggacatgcca tgcattcaat gctaggacgt 1620 
tgccctactg attttgctga ggttccttct 1680 
cgagtagtta accaatttgc cagacattat 1740 
gtgtctcgtc tttgtgaatc taaaaaggtt 1800 
ttttatgcca ctctggatca aatctaccat 1860 
gacattctca aggaaacaca agagaaattc 1920 
tggcagctgc gattcagcca cctcgtgggg 1980 
tccagagcgg tcgcctccat ggtttggaag 2040 
gctgccgggg agcgctatcg cagggagatg 2100 
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ctggcccacg gtggaggcag ggagcccatg 
ccttctgttg atgacttcgt aagtgccctc 
ttcctcatgg attctgaata aaagaaacac 
taatgacttt gttataaatg ctacagctgt 
tctgtaattc tgaaaaactt taaactggta 
aaaaaaaaaa aaaaaaaaaa aaaaagggcg 



ctcatggttg aaggtatgct tcagaagtgt 2160 
gtttccgact tggatctgga cttcgaaact 2220 
tctacacctc ttaaatcaag gtcatgtaga 2280 
gagagcttgt ttctgatttc attgttcgct 2340 
gaacttggaa taaataattt gttttaatta 2400 
ccgctcgcga tctagac 2447 



<210> 98 

<211> 2384 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 7503334CB1 



<400> 98 

tatgaagaag aaactcaaaa taacaggagt 
gagggtacca accaaaggcc cttgagcaat 
gcacgctcaa ccccaacccc acggacagct 
ctgccctgcc ctgcagggtg agccggcaca 
ggctggtgcg gacctgcctg ccgcccggag 
agccggaggt ctttgcttcc gcctgtgccc 
gctcaggtgc ctctcggcgc ttggagaaga 
gacaagacaa cgggccggtg gaggtgtcca 
gcgtgcggct agcagcccgc atcgtggaca 
tcctcgagga gattaacaaa gttggaaagg 
atgaggaact gaagacgaga ggatttggag 
atcccccagc cctggccgtc ctcagccaca 
gggtgggcaa aggcatcgtc tatgacactg 
tgccggggat gaagcgagac tgcgggggtg 
caatcaagca gggtttcaaa gacaacctcc 
tsgggcccaa tgcgacaagg ccagatgaca 
aaatcaacaa cacggatgcc gagggcaggc 
gcaaggacct gggggccgac atcatcctgg 
ttgccacagg gaagtaccac gccgcggtgc 
gtgtgaaggc gggcaggaag tgtggggacc 
tgcacttcag cgagttcacc tcagctgtgg 
acaacagccc cagctcctgt gctggcctct 
ccggagtctg ggtccacctg gacattgctg 
gcttcggtgt ggccctcctg ctggcgctct 
acctggtgtc cccactgggc tgtgaggtgg 
ccaagagacg caggcttgtg tgagcctcct 
cctcactttg cactgattaa ttttaagcaa 
tttgtctttc tggtcgtcag cgtggtggtg 
gggtttggtg ^cgggccacgg ggaggggacc 
acttacagga ctgagacatc ttctgtaaac 
gtgaggcctc ctgcctgcct ggtgccctgt 
cgtggctgac agccaggctc ttactccagc 
ccctgtgaaa gatggagctg acttgctgca 
acactctggc ctctcagaat tcacttgagg 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 
agaggggggg ggagcagaaa aggaggaggc 
aggagaacag agggggagca caccgagtcc 
ggtcgggcac aaatcaagcg ggcacaaaca 
cgaacgaggg ggcgacgaaa "acacacacca 
cggcgactaa caccagaaaa agacggcgaa 

<210> 99 

<211> 778 

<212> DNA 

<213> Homo sapiens 



ggcttatgga actacatgga ggtaacagag 60 
caggatgttg gctctggcag gctgccctga 120 
gtcccctcta cctgaactac gccaccgtgg 180 
acagcccctc ggccgcccac ttcatcacgc 240 
cgcatcgctg cattgtgatg gtctgcgagc 300 
tggcccgggc cttcccgctg ttcacccacc 360 
agacggtcac cgtggagttt ttcctggtgg 420 
cattgcagtg cttagcgaat gccacagacg 480 
caccctgcaa tgagatgaac accgacacct 540 
agctggggat catcccaacc atcatccggg 600 
gaatctatgg ggttggcaaa gccgccctgc 660 
ccccagatgg agccacgcag accatcgcct 720 
gaggcctcag catcaaaggg aagactacca 780 
ctgcggccgt cctgggggcc ttcagagccg 840 
acgctgtgtt ctgcttggct gagaactcgg 900 
tccacctgct gtactcaggg aagacggtgg 960 
tggtgctggc agatggcgtg tcctatgctt 1020 
acatggccac cctgaccggg gctcagggca 1080 
tcaccaacag cgctgagtgg gaggccgcct 1140 
tggtgcaccc gctggtctac tgccccgagc 12 00 
cggacatgaa gaactcattg gcggaccgag 1260 
tcatcgcctc acacatcggc ttcgactggc 1320 
caccggtgca tgctggtgag cgagccacag 1380 
tcggccgtgc ctctgaggac cctctgctga 1440 
atgtcgagga gggggacctg gggagggact 1500 
gcctcggccc tgacaaacgg ggatctttta 1560 
ttgaaagatt gcccttcata tgggttttgg 162 0 
gaaacagctg aagttttagg agacagctta 16 80 
gggaagcgct ggggcttgtt tctgtttgtt 1740 
tgctacccct ggggccttct gcaccccggg 1800 
cccagcccca ggtcccgtgc agggcacctg 1860 
cggggctgcc agcgcatcca gccagcccag 1920 
ggggacctga tttatagggc aagagaagtc 1980 
ttcaattaaa tacagtcaca ccgccccctc 2040 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaag 2100 
gcgcgaaccg gggggaaata taagcggaga 2160 
cacaagaagg gggcgcgcaa ataacacacc 222 0 
cacggcggga ggaaatcgtg tcccacagaa 2280 
gacgctcgcg gagaacccac aaagagggag 2340 
acaccggaaa gacg 2384 
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<220> 

<221> mis cofeature 

<223> Incyte ID No: 7503341CB1 

<400> 99 

gccagtattg ccgggctatg gcggcgagca 
ggtgctgggt gctgcggcgg ccgatgctgg 
caggagcgtg gttgctccga ggccagcgga 
ggggccgagg cccggcaatt gggggccaat 
gcggaggcgc attctcgggg ggcgaggacg 
gcggcgcggg gggcagcgcg ggcgccgggg 
tgacgatccc cgatgtgttt ccgcacctgc 
tcccgcgctt tatcaagatt atcgaggtta 
ggaaagttcg tctcgcccag ccttatgtcg 
agtcggatgt ggtcgagagc ctggatgaaa 
gtcaggccct gtctgggcca gaactgagcg 
gagccaccga gcgagcagct cggtccagtg 
atcagagtgt ggcatagaag ctatttaatg 



ctggctacgt gcgactgtgg ggagcggcgc 6 0 
ccgccgccgg ggggcgggtt cccactgcag 120 
cctgcgacgc etc tec tec t tgggcactgt 180 
ggcgggggtt ttgggaagcg agcagccgcg 240 
cctccgaggg cggcgcggag gaaggagccg 300 
aaggcccggt cataacggcg ctcacgccca 360 
cgctcatcgc catcacccgc aacccggtgt 42 0 
aaaataagaa gttggttgag ctgctgagaa 480 
gcgtctttct aaagagagat gacagcaatg 540 
tctaccaccc cgggactgca ggcggcggat 600 
ctgtggggag cgcgcccgga cctggcagtg 660 
acccagatcc cagggacctc agtcggctta 720 
attaaagtca tttgcagtaa aaaaaaaa 778 



<210> 100 

<211> 3705 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> inisc_f eature 

<223> Incyte ID No: 7509936CB1 



<400> 100 

gcctacggat gggaaccacg gactgacctg 
gattcacgtg tgcccaggcc ctgcaggtgg 
tcgcccgggt ctatggccac ggtcggagag 
ccggccggcc agcgccagtg ctccatctgc 
cacatcctgc ggctcagcgt ggaggagcag 
aacttcctgg gcaaggaggc ctgcgaggtg 
gctgccttcc tgccagtcct caacggggtc 
cccaagggca gctgccagga ggaagcaggt 
gagcccgcca gggggcagtg cgaggacaag 
gaggagggag cggcggagcc cagagggggc 
acgcccgtgg ccagctcctg ctccgtctgc 
atccctcctg aggccctggt ggtcccggaa 
cctgccgcgc ctccacctgg cctccccggg 
agccagggcc cagctgccga accagagccg 
tccaccctgc agaacaaccc cgtgccgcgc 
ccaccggtgc ctgaggctgc ccagccgtca 
ggctcgggct gggctggggc ctcccgccta 
gtgctggagg aagaggccac ggagggtggc 
tcgggcagtg acatcattga cctggccgga 
tccagccccg acttcaccac ctggtcatgt 
gcccccaggt gctcggcctg cggctgctcc 
ccccccaccc actgccccga ctgtggggcc 
ggacgggtgt cctcggccca gaaggccgcc 
gcctgccctg cctgtaccct gctcaacgca 
acgcctcagc tcctggtggc ccagcggcgg 
atgcacgtgg agcagcggcg gcagacagac 
atcgtggcct tctgccggga gaacaatgtg 
cccgagtctg tcggcttccc cgcgggtgac 
cgaccccagg agatcaactg ctccgtcttc 
cacacactgc ggccctcaga catcctgcag 
gccctggcgg tgctggcgga gcggccggac 
ctgtgtgcag agggcgccta ccaggtgcgg 
ctggtggacg acatgctgcc ctgtgatgag 
cggaagcagc tgtgggtggc cctcatcgag 



acctgcgtcc tcggggccac tgcactgggt 60 
gccggacctg ggagtggcag ccacacccac 120 
tggtcctgtg tgcgctgcac cttcctgaac 180 
gaggctcccc ggcacaagcc cgacctcaac 240 
aaatggccct gcgcccgctg caccttccgc 3 00 
tgcggcttca ccccggagcc tgcgcctggg 3 60 
ctccccaagc cacccgccat cctgggggag 420 
ccagtgagga ctgcggggct ggtggccacg 480 
gacgaggagg agaaggagga gcaggaggag 540 
tgggcgtgtc cgcgttgcac gctgcacaac 600 
gggggcccac gcaggctctc gctgccacgg 660 
gtggtggccc cggccggctt ccacgtcgtg 720 
gaaggtgccg aggccaaccc cccagccacc 7 80 
cccagggtcc cgcccttcag ccccttctcg 840 
agccgacgcg aggttccccc ccagctgcag 900 
ccctctgccg gctgcagggg agccccccag 960 
gcagagttgc tgtctggcaa gcggctgagt 1020 
accagccgcg tagaggccgg cagctccacc 1080 
gacaccgtgc gttacacgcc cgccagcccc 1140 
gccaagtgca cgctcagaaa ccccacagtg 1200 
aaactgcacg gcttccagga gcatggcgag 1260 
gacaagccca gcccctgcgg cagaagctgc 1320 
cgcgtcctgc ccgagcgccc gggccagtgg 1380 
ctgcgggcca agcactgcgc cgcctgccac 1440 
ggggccgcgc ccctgaggcg cagggagagc 1500 
gagggcgagg ccaaggcact ctgggagaac 1560 
agcttcgtgg atgacagctt ccctcccggg 1620 
agcgtgcagc agcgtgtgag gcagtggctg 1680 
agggaccaca gggccacgtg gtctgtgttc 1740 
gggctgctgg ggaactgctg gttcctgagc 1800 
ctggtggagc gggtgatggt cacgcgcagc 1860 
ctgtgcaagg acggcacgtg gaccacggtg 1920 
gccggctgcc tcctcttctc acaggcgcag 1980 
aaggcgctgg ccaagctgca cggctcctac 2040 
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tttgcgctcc aggcgggccg cgccatcgaa 
gagagcctgg cgctgcagct cagctccact 
ctcatctggg ccaaaatgct gagttctaag 
ggcgggggca acatgaaggt ggacgattcg 
catgcctact ccatcctgga tgtccgagat 
aacccgtggg gccgtttctc ctggaacggc 
gggcacctgc gtggcgagct catgccgcac 
tacggcgact ttgtcaggta cttcgactcc 
caggaggcgc gggtgcaggg ctgctttccc 
gcgctcacgg tgctggagcg ggcctcgctg 
cgctcggacg ccgtggacag ccacctgctg 
ttcggcagcg gcggccacct cagcctgggc 
aagaagttcg tcagctgcga cgtcatgctg 
gccttcaacc actgggggcc gcccctgccg 
ctaccccagg ctcatgcccc aggcccacgg 
tagcctccag cccctcggca ggggtcccga 
tggctgtgta cagctcgagg ctggtcatgg 
tggccgacgc catcatcctg ctcaccgaga 
gcatgacctg ctactacctg acacacggtt 
gacaccccaa ggcctacctg cacgtgcagt 
ccacacgcgg cagcctgcgt acccaggata 
tgatcttgtc ccagctagag ggcaacgccg 
gcaaggcagc ccaggccttc ctcagtgact 
cgctcacgcc agaggtcgcc ggtctgcatg 
ggcaggggct gtgcacagac ggacccccca 
gacagaggac gcttgagcca gaatctcagg 
tcagcttccc atgggccccc cgcccccctc 
ggcacaggcc tcctggccgc cacgcagaat 



ggcctggcca cgctcaccgg cgccccctgt 2100 
aacccccgcg aggagcccgt tgacactgac 2160 
gaggctgggt tcctcatggg tgcctcctgt 2220 
gcctacgaga gcctgggcct gcgcccccgg 2280 
gtccagggca ccaggcttct gcggctccga 2340 
agctggtccg acgagtggcc acactggccg 2400 
ggcagcagtg agggtgtctt ctggatggag 2460 
gtggacatct gtaaggtgca ctcggactgg 2520 
agctcggcca gcgcgcccgt gggggtaaca 2580 
gagttcgcgc tcttccagga gggcagcagg 2640 
gacctgtgca tcctggtgtt ccgggccacg 2700 
cgcctcctgg cccacagtaa gcgcgcggtc 2760 
gagcctggcg agtacgctgt ggtgtgctgc 2820 
ggcacccctg ccccccaggg tacgtggccc 2880 
ggagggctgc ggttcacacg cccgtccttg 2940 
gagcctcccc agagccgccg ggccacgtgc 3000 
tggagcccgt ggaagcccag ccgaccacgc 3 060 
gccgcggaga gcggcacgag ggccgtgagg 3120 
gggcggggct catcgtggtg gtggagaacc 3180 
gtgactgcac cgacagcttc aacgtggtgt 3240 
gcgtgccacc cctgcacagg caggtcctgg 3300 
gcttctctat cacccaccgc ctggcacatc 3360 
ggacagcctc caaggggacc cacagccccc 3420 
ggccccgacc gctgtgacca ccatgcctgg 3480 
cccccacacg cactttatga gggagacccc 3540 
accccgccca gggagctgcc agcccagggc 3 600 
cttcccctcc ctgaacccca cagtccgtct 3660 
acctcgaacc aggtg 3705. 



<210> 101 

<211> 3872 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7509986CB1 



<400> 101 

cccgcgcgtg ccgcggcatg gcggccgtgc 
cgttgctggc cctgtgccct ggagggcgcc 
aggagttcct cgagggcttc ctgtcagagc 
aggccccgcc gcctcccgag cccaccccgc 
cagggaagcg gccagggacg gccgcagaag 
agaaaggcaa gaaagacaaa ggccccaagg 
ggccgcccaa gaaggggaag gagaagccac 
cacctaaggc caccaagaag cccaaggaga 
agaagccacc caaggccacc aagaagcccc 
cctcagaaac cctggagtgg ccactgcccc 
cccaggaggg aggggcgccc ctctcaaata 
tggaggcacg ggagcaccag cctgagccgg 
acaatgacca gatcgagagg gaggactatg 
aacccaggcc acccccaagc agaaggagga 
aggagaaggc cccggcccca gccccggagg 
tgcccccgct gccccctgac tatggtgatg 
actattactt tgggcctcct ccgccccaga 
agaaggagga gctgaagaaa cccaaaaagg 
agtgggcagt ggagaagggc aaggaccaca 
aggagtggac gcctacggag aaagtcaagt 
ttgaggacaa ccagatccga gcctcctcca 
gccggctcaa catgcagacc ggtgccactg 
ccgaggacga tgccaggacc cagtggatag 
caggcgtcat cacccagggc agagactcca 



gcggggcgcc cctgctcagc tgcctcctgg 60 
cgcagacggt gctgaccgac gacgagatcg 120 
tagaacctga gccccgggag gacgacgtgg 180 
gggtccgaaa agcccaggcg gggggcaagc 240 
tgcctccgga aaagaccaaa gacaaaggga 300 
tgcccaagga gtccttggag gggtccccca 360 
ccaaggccac caagaagccc aaggagaagc 420 
agccacccaa ggccaccaag aagcccaaag 480 
cgtcagggaa gaggcccccc attctggctc 540 
caccccccag ccctggcccc gaggagctac 600 
actggcagaa tccaggagag gagacccatg 66 0 
aggaggagac cgagcaaccc acactggacfc 72 0 
aggactttga gtacattcgg cgccagaagc 78 0 
ggcccgagcg ggtctggcca gagacccctg 840 
agaggattga gcctcctgtg aagcctctgc 900 
gttacgtgat ccccaactac gatgacatgg 960 
agcccgatgc tgagcgccag acggacgaag 1020 
aggacagcag ccccaaggag gagaccgaca 1080 
aagagccccg aaagggcgag gagttggagg 1140 
gtccccccat tgggatggag tcacaccgta 1200 
tgctgcgcca cggcctgggg gcacagcgcg 1260 
aggacgacta ctatgatggt gcgtggtgtg 1320 
aggtggacac caggaggact acccggttca 1380 
gcatccatga cgattttgtg accaccttct 1440 
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tcgtgggctt 
tgacctttca 
tggtggctcg 
tggaggtgct 
tggccaccga 
aggtggtgaa 
gaggcctcaa 
agcccgagtt 
tgctgctgct 
gcctggtgca 
tggcagcgca 
gctttgacat 
ggaaatgggt 
cgccagatgc 
ccttcgtgct 
tggcccgcac 
aggatgagga 
ttgccatctc 
aagcccagga 
ccgggactat 
tgggctgtga 
aggcgctgct 
ccagtggtgg 
cggagggcta 
agtgcaactt 
acgggaaccg 
gcctgcagca 
gcctcaacgc 
ctgccaccac 
gctgggagga 
tggagcccga 
ttgagaccca 
tagccactgg 
acttctgaga 
tcccaagcct 
acactgaaag 
agagccagag 
gctgcagtgt 
catggcaaaa 
aaaaaaaaaa 

ggggcggcgc 



cagcaatgac 
tgggaacgtg 
tttcatccgc 
ggggtgctct 
tgacctggat 
cgaggagtgc 
gatctatgcc 
ccgctacact 
catgcagtac 
ggacacacgc 
gatgggctca 
ctttgaagat 
cccctaccgg 
cacggtatcc 
gg^agcaaat 
gcctacccag 
cgaggtctcc 
cttcgcctcc 
ctacaccggc 
caatgacttc 
caagttccct 
caccttcatg 
tgattactgg 
caccccgagc 
catcctggct 
gcctatccca 
gcgacgccta 
caccaccacc 
cctgagcact 
gtcggagact 
gtttgggacc 
gctggaaccc 
ccaggcattc 
tcagcgtcct 
gctgaccaca 
gaagggctgg 
gctgtgtaga 
tggagtaggg 
aaaaaaaaaa 
aaaaaaaaaa 
gtgggtgggc 



agccagacat 
gacaaggaca 
atctacccac 
gtggcccctg 
ttccggcacc 
cccaccatca 
atggagatct 
gctgggatcc 
ctgtgccgag 
atccacctgg 
gagtttggga 
ttcccggatc 
gtccccaaca 
acggaggtcc 
ctgaacggcg 
gagcagctgc 
gaggcccagg 
gcacacctca 
ggcatgggca 
agttacctgc 
catgagagtg 
gagcaggtgc 
cgaatcttga 
gccaagacct 
cgctccaact 
cacatagacc 
caacaccgcc 
ctaggccccc 
accatagagc 
gagacctaca 
aaggtggagc 
gagtttgagg 
cccttcacaa 
accaagaccc 
gtcacatcac 
tcctgcccct 
ggctcctgct 
gcagagggag 
aaaaaaaaaa 
aaaaaaaaaa 
ccccgcgcgg 



gggtgatgta 
cacccgtgct 
tcacctggaa 
tctacagcta 
acagctacaa 
cccgcactta 
cagacaaccc 
atggcaacga 
agtaccgcga 
tgccctcact 
actgggcgct 
tcaactctgt 
afcaacttgcc 
gggccatcat 
gcgagcggct 
tggccgcagc 
agactccaga 
ccttgaccga 
tcgtcaacgg 
ataccaactg 
agctgccccg 
accgtggcat 
acccgggtga 
gcaatgttga 
ggaagcgcat 
catcgcgccc 
tgcggcttcg 
acactgtgcc 
cctggggcct 
cagaggtggt 
ccgagtttga 
aagaggagga 
cagtagagac 
cagcccaact 
ccatcagcac 
ttgagggggt 
ccacctgcca 
ggagccaagg 
aaaaaaaaaa 
aaaaaaaaaa 

gg 



caccaacggc 
gag t gage tc 
tggcagcctg 
ctacgcacag 
ggacatgcgc 
cagcctgggc 
tggggagcat 
ggtgctgggc 
tgggaaccca 
gaaccctgat 
gggactgtgg 
gctctgggga 
catccctgaa 
tgcctggatg 
agtatcctac 
catggcagca 
ccacgccatc 
gccctaccgc 
ggccaagtgg 
cctggagctc 
cgagtgggag 
taatcacggc 
gtaccgcgtg 
ctatgacatc 
ccgggagatc 
tatgaccccc 
ggcacagatg 
tcccacgctg 
cataccgcca 
gacagagttt 
gacccagttg 
ggaggagaaa 
ctacacagtg 
caagctacag 
atggaaggcc 
gcaaacatga 
gtctcgtaag 
tcactccaat 
aaaaaaaaaa 
aaaaaaaaaa 



tatgaggaaa 
ccagagccgg 
tgcatgcgcc 
aatgaggtgg 
cagctcatga 
aagagctcac 
gaactggggg 
cgagagctgt 
cgtgtgcgca 
ggctacgagg 
actgaggagg 
gctgaggaga 
cgctaccttt 
gagaagaacc 
ccctacgata 
gcccgggggg 
ttccggtggc 
ggaggctgcc 
aacccccgga 
tccttctacc 
aacaacaagg 
gtgaagacag 
acagcccacg 

ggggccactc 

atggccatga 
caacagcgac 
cggctgcggc 
ccccctgccc 

accaccgctg 
gggaccgagg 
gagcctgagt 
gaggaggaga 
aactttgggg 
cagcagcact 
cctggtatgg 
ctgggaccta 
agatggggtt 
aaaacaagct 
aaaaaaaaaa 
aaaagaaggg 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3872 



<210> 102 

<211> 2545 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> inisc_f eature 

<223> Incyte ID No: 7510010CB1 



<400> 102 

cggctcgagg tgagtttggc cactccgtag 
cagctgccag ccatctgtcc attcacccat 
tgtctgtctg tccgcccatc tgtaagccca 
tctcttactg tcctctttgt ctagctatct 
tcttcagccc ccacctgttt gtccatctgt 
gtccattcat ctgcccaccc atccgtccct 
tgggctgcag agccatggcc cggggctacg 
tggggctggc gctggctgtc attgtgctgg 
gtggccccca ggcctttgcc cacgctgctg 
ttggacgagc catcctccag cagcagggct 
tctgcaccag cgtcgtcaac cctcagagca 



tgtgcacttg gtgagggcag cagctcgcca 60 
ctgtccatct ggcagcccgc tgttcagacc 12 0 
tctctgtccc attgtctatc tgaccatctt 180 
ggcctatctg tcgatccatc ttcgtgtctg 240 
ccaattacct gtgactctgt gcatcttctt 300 
ccgtctgccc accagccgcc cctctcctcc 360 
gggccacggt cagcctagtc ctgctgggtc 420 
ctgtggtcct ctctcgacac caggccccat 480 
ttgccgccga ctccaaggtc tgctcggata 540 
cacccgtgga tgccaccatc gcggctctgg 600 
tgggcctggg cggaggggtc atcttcacca 660 
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tctacaatgt gacaacaggg aaggtggagg tcatcaatgc ccgggagacg gtgccggcca 720 
gccacgcccc gagcctgctg gaccagtgtg cacaggctct gccactgggc acaggggccc 780 
agtggatcgg ggtgcccggg gagctccgtg gctatgccga ggcccaccgc cgtcatggcc 840 
gcctgccctg ggcgcagctg ttccagccca ccatcgcgct gctccgaggg gggcatgtgg 900 
tggcccctgt cctcagccgt ttcctgcaca acagcatcct gcggccttcc ttgcaggcgt 960 
caaccctgcg ccagctcttc ttcaacggga cagaacccct gaggcctcag gacccactcc 102 0 
catggcctgc actggccacc accctggaga ccgtggccac agagggcgtg gaggtcttct 1080 
acacggggag gctgggccag atgctggtgg aggacattgc caaggaaggg agccagctga 1140 
cgctgcagga cctggccaag ttccagcccg aggtggtgga tgccctggag gtgcccctgg 1200 
gggactatac cctgtactca ccaccgccgc ctgcaggggg tgccattctc agctttatcc 1260 
tcaacgtgct aagagggttc aacttctcaa cagagtctat ggccaggcct gaagggaggg 1320 
tgaacgtgta ccaccacctt gtagagacgc tcaagtttgc cagggggcag aggtggaggc 1380 
tgggggaccc tcgaagccac ccgaagctcc agaatgcctc ccgggacctg ctgggggaga 1440 
ccctggccca gctcatccgc caacagatcg atggccgggg ggaccaccag ctcagccact 1500 
acagcttggc cgaggcctgg ggccacggga caggcacgtc ccatgtgtct gtgctggggg 15 60 
aggatggcag cgccgtggct gccaccagca ccatcaacac accctttgga gcgatggtgt 1620 
attcaccacg gacaggcatc atcctcaaca acgagctcct ggacttatgc gagcgatgcc 16 80 
cccggggttc cggcaccacc ccctcacctg tgagtggaga cagggtgggt ggagctcccg 1740 
gaaggtgctg gcccccagtt ccaggcgagc gttccccatc ctccatggtg ccctccatct 1800 
tgatcaacaa agcccagggg tcgaagctag tgattggcgg ggctggcggg gage t cat ca 1860 
tctctgctgt ggcccaggcc atcatgagca agctgtggct tggctttgac ctgagagcgg 1920 
ccattgcagc ccccatcctg catgtcaaca gcaagggctg tgtggagtac gagcccaact 19 80 
tcagccagaa acaggatctt gctgcattgc ccaggctggt ctcaaactcc tggcctcaag 2040 
tgatcctcct tcctcagcct cccaaattgc tggcattaca ggaggtgcag aggggactcc 2100 
aagaccgtgg ccagaaccag acccagaggc ccttcttcct gaacgtggtc caggctgtgt 2160 
cccaggaggg ggcctgtgtg tacgccgtct cggacctgag gaagagtggg gaggccgcag 2220 
gctactaaga cactgctctg cccagagctg aagtctggcc ccaccatgag tcctgtgtcc 2280 
aggccggaca tggctggggg accaactact ctggcaggat ctggacccct ggcaggggag 2340 
tccagctgag agtggaagag gtggcgggga ccagctgggc agatgagagg ctgagcctca 2400 
tccctaaccc cctttcccag agcccctggt ggtcctgaac cggcccctct atccctccgc 2460 
aggcctcttg cctggggcca ctctcccacc ctctcgatct gtatatcctc cagtccaaga 2520 
ttaaagaggc ggactgtgaa aaaaa 2545 

<210> 103 

<211> 3273 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc^f eature 

<223> Incyte ID No: 7510056CB1 

<400> 103 

gaggactatg aggactgtga 
tgggggatgt gccaatgggc 
tcagttgagt acattcggcg 
cccgagcggg tctggccaga 
aggattgagc ctcctgtgaa 
tacgtgatcc ccaactacga 
cccgatgctg agcgccagac 
gacagcagcc ccaaggagga 
gagccccgaa agggcgagga 
ccccccattg ggatggagtc 
ctgcgccacg gcctgggggc 
gacgactact atgatggtgc 
gtggacacca ggaggactac 
atccatgacg attttgtgac 
gtgatgtaca ccaacggcta 
ctcccattgt ctgggcgagg 
tgcctctccc cagacctttc 
cccagagccg gtggtggctc 
gtgcatgcgc ctggaggtgc 
gaatgaggtg gtggccaccg 



gtagggtcct gccagcccca 
ccatcccagc cttgggcccc 
ccagaagcaa cccaggccac 
cccccctgag gagaaggccc 
gcctctgctg cccccgctgc 
tgacatggac tattactttg 
ggacgaagag aaggaggagc 
gaccgacaag tgggcagtgg 
gttggaggag gagtggacgc 
acaccgtatt gaggacaacc 
acagcgcggc cggctcaaca 
gtggtgtgcc gaggacgatg 
ccggttcaca ggcgtcatca 
caccttcttc gtgggcttca 
tgaggaaatg gtgggcacca 
ggtgggctct cagaggggct 
atgggaacgt ggacaaggac 
gtttcatccg catctaccca 
tggggtgctc tgtggcccct 
atgacctgga tttccggcac 



cctgggtcgg acccctggcc 60 
actctgagcc agcctccccc 120 
ccccaagcag aaggaggagg 180 
cggccccagc cccggaggag 240 
cccctgacta tggtgatggt 3 00 
ggcctcctcc gccccagaag 360 
tgaagaaacc caaaaaggag 420 
agaagggcaa ggaccacaaa 480 
ctacggagaa agtcaagtgt 540 
agatccgagc ctcctccatg 600 
tgcagaccgg tgccactgag 660 
ccaggaccca gtggatagag 720 
cccagggcag agactccagc 780 
gcaatgacag ccagacatgg 840 
tgcccaggct cttggctctg 900 
ggcagtactg ctctgaggcc 960 
acacccgtgc tgagtgagct 1020 
ctcacctgga atggcagcct 1080 
gtctacagct actacgcaca 1140 
cacagctaca aggacatgcg 1200 
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ccagctcatg aaggtggtga acgaggagtg 
caagagctca cgaggcctca agatctatgc 
tgaactgggg gagcccgagt tccgctacac 
ccgagagctg ttgctgctgc tcatgcagta 
acgtgtgcgc agcctggtgc aggacacacg 
tggctacgag gtggcagcgc agatgggctc 
gactgaggag ggctttgaca tctttgaaga 
agctgaggag aggaaatggg tcccctaccg 
acgctacctt tcgccagatg ccacggtatc 
ggagaagaac cccttcgtgc tgggagcaaa 
cccctacgat atggcccgca cgcctaccca 
agcccggggg gaggatgagg acgaggtctc 
cttccggtgg cttgccatct ccttcgcctc 
cggaggctgc caagcccagg actacaccgg 
gaacccccgg accgggacta tcaatgactt 
ctccttctac ctgggctgtg acaagttccc 
gaacaacaag gaggcgctgc tcaccttcat 
ggtgacggac gagcaaggca tccccattgc 
tcacggcgtg aagacagcca gtggtggtga 
ccgcgtgaca gcccacgcgg agggctacac 
tgacatcggg gccactcagt gcaacttcat 
ggagatcatg gccatgaacg ggaaccggcc 
gaccccccaa cagcgacgcc tgcagcagcg 
acagatgcgg ctgcggcgcc tcaacgccac 
cacgctgccc cctgcccctg ccaccaccct 
accgccaacc accgctggct gggaggagtc 
agagtttggg accgaggtgg agcccgagtt 
ccagttggag cctgagtttg agacccagct 
ggagaaagag gaggagatag ccactggcca 
cacagtgaac tttggggact tctgagatca 
gctacagcag cagcacttcc caagcctgct 
gaaggcccct ggtatggaca ctgaaaggaa 
aacatgactg ggacctaaga gccagaggct 
tcgtaagaga tggggttgct gcagtgttgg 
ctccaataaa acaagctcat ggcaaaaaaa 



ccccaccatc acccgcactt acagcctggg 1260 
catggagatc tcagacaacc ctggggagca 1320 
tgctgggatc catggcaacg aggtgctggg 13 80 
cctgtgccga gagtaccgcg atgggaaccc 1440 
catccacctg gtgccctcac tgaaccctga 1500 
agagtttggg aactgggcgc tgggactgtg 1560 
tttcccggat ctcaactctg tgctctgggg 1620 
ggtccccaac aataacttgc ccatccctga 1680 
cacggaggtc cgggccatca ttgcctggat 1740 
tctgaacggc ggcgagcggc • tagtatccta 1800 
ggagcagctg ctggccgcag ccatggcagc 1860 
cgaggcccag gagactccag accacgccat 1920 
cgcacacctc accttgaccg agccctaccg 1980 
cggcatgggc atcgtcaacg gggccaagtg 2040 
cagttacctg cataccaact gcctggagct 2100 
tcatgagagt gagctgcccc gcgagtggga 2160 
ggagcaggtg caccgcggca ttaagggggt 2220 
caacgccacc atctctgtga gtggcattaa 2280 
ttactggcga atcttgaacc cgggtgagta 2340 
cccgagcgcc aagacctgca atgttgacta 2400 
cctggctcgc tccaactgga agcgcatccg 2460 
tatcccacac atagacccat cgcgccctat 2520 
acgcctacaa caccgcctgc ggcttcgggc 2580 
caccacccta ggcccccaca ctgtgcctcc 2 640 
gagcactacc atagagccct ggggcctcat 2700 
ggagactgag acctacacag aggtggtgac 27 60 
tgggaccaag gtggagcccg agtttgagac 2820 
ggaacccgag tttgaggaag aggaggagga 2880 
ggcattcccc ttcacaacag tagagaccta 2940 
gcgtcctacc aagaccccag cccaactcaa 3000 
gaccacagtc acatcaccca tcagcacatg 3060 
gggctggtcc tgcccctttg agggggtgca 3120 
gtgtagaggc tcctgctcca cctgccagtc 3180 
agtaggggca gagggaggga gccaaggtca 3240 



<210> 104 
<211> 1409 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 7510398CB1 



<400> 104 

gccagggacc aagatggatc ttctcctcga 
cagagaccat ggagagggac agccacggga 
gagcatctcc agcccaggca tctccagctg 
aggcatctcc agcccaggca tctccagctg 
aggcatctcc agctggtaca cctccaggcc 
aggcatctcc agcccgggca tctccggctc 
ggtcatcatc cgccaggtca gcctcggtga 
gagcaacacc agtgggggct gtacccatcc 
ccagggccac cagggagagc ccaggtacga 
agaagcagct accgctcatc gggtgcgtgc 
tcatcctctt ccagttctgg cagggccaca 
gctgtcccaa gcacgctgtt cgctgtgacg 
agctgggctg cgtgaggttt gactgggaca 
cccatcagtg gcttcccatc tgtagcagca 
gccagcagct gggtttcgag agtgctcacc 
ccaacagctt ctcaatcttg agatacaact 
aatgcccttc ccagcggtat atctccctcc 



catcagctaa gcctggagga ctcttcccct 60 
atgcatctcc agcaagaaca ccttcagctg 120 
ggacacctcc aggccgggca tctccagccc 180 
ggacacctcc gggccgggca tctccagccc 240 
gggcatctcc aggccgggca tctccagccc 3 00 
tggcatcact ttccaggtcc tcatccggca 3 60 
caacctcccc aaccagagtg taccttgtta 420 
gatcatctcc tgccaggtca gcaccagcaa 480 
gcctgcccaa gttcacctgg cgggagggcc 540 
tec tec teat tgccctggtg gtttcgctca 600 
cagggatcag gtacaaggag cagagggaga 660 
gggtggtaga ctgcaagctg aagagtgacg 720 
agtctctgct taaaatctac tctgggtcct 780 
actggaatga ctcctactca gagaagaect 840 
ggacaaccga ggttgcccac agggattttg 900 
ccaccatcca ggaaagccte cacaggtctg 960 
agtgttccca ctgcggactg agggccatga 1020 
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ccgggcggat cgtgggaggg gcgctggcct 
tgcacttcgg caccacccac atctgtggag 
ctgccgccca ctgcttcttc gtgacccggg 
cgggcaccag caactcatat ccgggcccca 
ttaaagatcc ttacatggaa cacttctgtt 
tataaataag tttgctccaa atgacacatc 
ctgaataagt ttgctccaaa tgacacatc 

<210> 105 
<211> 4025 
<212> DNA 

<213> Homo sapiens 



cggatagcaa gtggccttgg caagtgagtc 1080 
gcacgctcat tgacgcccag tgggtgctca 1140 
agaaggtcct ggagggctgg aaggtgtacg 1200 
aagcttctgc agggcagaag tcaaagactc 12 60 
ttataattag ggaaactgaa gcccaagggt 1320 
agctaagcct ctcctcgaca tcagctaagc 1380 

1409 



<220> 

<221> inisc_f eature 

<223> Incyte ID No: 7510498CB1 



<400> 105 

tctcccgccc cttcctggat tccctcaccc 
gacccagagc ccctgacccc ccgcgccctc 
ggcggccgtg cgcggggcgc ccctgctcag 
tggagggcgc ccgcagacgg tgctgaccga 
cctgtcagag ctagaacctg agccccggga 
gcccaccccg cgggtccgaa aagcccaggc 
ggccgcagaa gtgcctccgg aaaagaccaa 
aggccccaag gtgcccaagg agtccttgga 
ggagaagcca cccaaggcca ccaagaagcc 
gcccaaggag aagccaccca aggccaccaa 
caagaagccc ccgtcaggga agaggccccc 
gccactgccc ccacccccca gccctggccc 
cctctcaaat aactggcaga atccaggaga 
gcctgagccg gaggaggaga ccgagcaacc 
ggaggactat gaggactttg agtacattcg 
cagaaggagg aggcccgagc gggtctggcc 
agccccggag gagaggattg agcctcctgt 
ctatggtgat ggttacgtga tccccaacta 
tccgccccag aagcccgatg ctgagcgcca 
acccaaaaag gaggacagca gccccaagga 
caaggaccac aaagagcccc gaaagggcga 
gaaagtcaag tgtcccccca ttgggatgga 
agcctcctcc atgctgcgcc acggcctggg 
cggtgccact gaggacgact actatgatgg 
ccagtggata gaggtggaca ccaggaggac 
cagagactcc agcatccatg acgattttgt 
cagccagaca tgggtgatgt acaccaacgg 
gctcttggct ctgctcccat tgtctgggcg 
ctgctctgag gcctgcctct ccccagacct 
tgctgagtga gctcccagag ccggtggtgg 
ggaatggcag cctgtgcatg cgcctggagg 
gctactacgc acagaatgag gtggtggcca 
acaaggacat gcgccagctc atgaaggtgg 
cttacagcct gggcaagagc tcacgaggcc 
accctgggga gcatgaactg ggggagcccg 
acgaggtgct gggccgagag ctgttgctgc 
gcgatgggaa cccacgtgtg cgcagcctgg 
cactgaaccc tgatggctac gaggtggcag 
cgctgggact gtggactgag gagggctttg 
ctgtgctctg gggagctgag gagaggaaat 
tgcccatccc tgaacgctac ctttcgccag 
tcattgcctg gatggagaag aaccccttcg 
ggctagtatc ctacccctac gatatggccc 
cagccatggc agcagcccgg ggggaggatg 
cagaccacgc catcttccgg tggcttgcca 



gtctcgatcc cctctccgcc ctttcccaga 60 
cccggagccc cccgcgcgtg ccgcggccat 120 
ctgcctcctg gcgttgctgg ccctgtgccc 180 
cgacgagatc gaggagttcc tcgagggctt 240 
ggacgacgtg gaggccccgc cgcctcccga 300 
ggggggcaag ccagggaagc ggccagggac 360 
agacaaaggg aagaaaggca agaaagacaa 420 
ggggtccccc aggccgccca agaaggggaa 480 
caaggagaag ccacctaagg ccaccaagaa 540 
gaagcccaaa gagaagccac ccaaggccac 600 
cattctggct ccctcagaaa ccctggagtg 660 
cgaggagcta ccccaggagg gaggggcgcc 720 
ggagacccat gtggaggcac gggagcaccg 780 
cacactggac tacaatgacc agatcgagag 840 
gcgccagaag caacccaggc cacccccaag 900 
agagacccct gaggagaagg ccccggcccc 960 
gaagcctctg ctgcccccgc tgccccctga 1020 
cgatgacatg gactattact ttgggcctcc 1080 
gacggacgaa gagaaggagg agctgaagaa 1140 
ggagaccgac aagtgggcag tggagaaggg 1200 
ggagttggag gaggagtgga cgcctacgga 1260 
gtcacaccgt attgaggaca accagatccg 13 20 
ggcacagcgc ggccggctca acatgcagac 13 80 
tgcgtggtgt gccgaggacg atgccaggac 1440 
tacccggttc acaggcgtca tcacccaggg 1500 
gaccaccttc ttcgtgggct tcagcaatga 1560 
ctatgaggaa atggtgggca ccatgcccag 1620 
aggggtgggc tctcagaggg gctggcagta 1680 
ttcatgggaa cgtggacaag gacacacccg 1740 
ctcgtttcat ccgcatctac ccactcacct 1800 
tgctggggtg ctctgtggcc cctgtctaca 1860 
ccgatgacct ggatttccgg caccacagct 1920 
tgaacgagga gtgccccacc atcacccgca 1980 
tcaagatcta tgccatggag atctcagaca 2040 
agttccgcta cactgctggg atccatggca 2100 
tgctcatgca gtacctgtgc cgagagtacc 2160 
tgcaggacac acgcatccac ctggtgccct 2220 
cgcagatggg ctcagagttt gggaactggg 2280 
acatctttga agatttcccg gatctcaact 2340 
gggtccccta ccgggtcccc aacaataact 2400 
atgccacggt atccacggag gtccgggcca 2460 
tgctgggagc aaatctgaac ggcggcgagc 2520 
gcacgcctac ccaggagcag ctgctggccg 2580 
aggacgaggt ctccgaggcc caggagactc 2640 
tctccttcgc ctccgcacac ctcaccttga 2700 
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ccgagcccta ccgcggaggc tgccaagccc 
acggggccaa gtggaacccc cggaccggga 
actgcctgga gctctccttc tacctgggct 
cccgcgagtg ggagaacaac aaggaggcgc 
gcattaaggg ggtggtgacg gacgagcaag 
tgagtggcat taatcacggc gtgaagacag 
acccgggtga gtaccgcgtg acagcccacg 
gcaatgttga ctatgacatc ggggccactc 
ggaagcgcat ccgggagatc atggccatga 
catcgcgccc tatgaccccc caacagcgac 
tgcggcttcg ggcacagatg cggctgcggc 
acactgtgcc tcccacgctg ccccctgccc 
cctggggcct cataccgcca accaccgctg 
cagaggtggt gacagagttt gggaccgagg 
ccgagtttga gacccagttg gagcctgagt 
aagaggagga ggaggagaaa gaggaggaga 
cagtagagac ctacacagtg aactttgggg 
cagcccaact caagctacag cagcagcact 
ccatcagcac atggaaggcc cctggtatgg 
ttgagggggt gcaaacatga ctgggaccta 
ccacctgcca gtctcgtaag agatggggtt 
ggagccaagg tcactccaat aaaacaagct 
aaacg 



aggactacac cggcggcatg ggcatcgtca 2760 
ctatcaatga cttcagttac ctgcatacca 2820 
gtgacaagtt ccctcatgag agtgagctgc 2880 
tgctcacctt catggagcag gtgcaccgcg 2940 
gcatccccat tgccaacgcc accatctctg 3 000 
ccagtggtgg tgattactgg cgaatcttga 3060 
cggagggcta caccccgagc gccaagacct 3120 
agtgcaactt catcctggct cgctccaact 3180 
acgggaaccg gcctatccca cacatagacc 3240 
gcctgcagca gcgacgccta caacaccgcc 3300 
gcctcaacgc caccaccacc ctaggccccc 3360 
ctgccaccac cctgagcact accatagagc 3420 
gctgggagga gtcggagact gagacctaca 3480 
tggagcccga gtttgggacc aaggtggagc 3540 
ttgagaccca gctggaaccc gagtttgagg 3600 
tagccactgg ccaggcattc cccttcacaa 3 660 
acttctgaga tcagcgtcct accaagaccc 3720 
tcccaagcct gctgaccaca gtcacatcac 3 780 
acactgaaag gaagggctgg tcctgcccct 3 840 
agagccagag gctgtgtaga ggctcctgct 3 900 
gctgcagtgt tggagtaggg gcagagggag 3960 
catggcacgg acaaagaaaa gatgacaaaa 4020 

4025 



<210> 106 
<211> 2951 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7510044CB1 

<400> 106 

gatggctacg gcagggggtg gctctggggc 
tctgtctttc tgcgtcctac tagcaggcct 
actttgtcca gatgtctcgt ttttcccaca 
aacaggtttg tgcaggggaa actcagtgga 
agctccctgt gttcgcctgc tcaacgccac 
tggagacaca ggggttatcc acgtagtaga 
tgatggcccc aacccccctt acatggttct 
aatggagaag ctgaaaggga gaaccagccg 
gcccagtcct gcctcaggct tctctcctag 
ttactccaat tcctatgggc cagagtttgc 
gggcaatggt ttggcttatg aagactttag 
tgaaaccaaa gtcatcaagc agtgctatca 
accaaccttc ccactatgtg ccatgcagct 
tgccacctgc atgcggcgca ggtccatcca 
ctgtgacccc ctgtctgatt acaatgtgtg 
gacattaaag cctgacgaca gggttgtggt 
tttctggaat gtggccccag gggctgaaag 
tgctgctgaa gctttgcaaa aggcacctga 
tgtcttcttt caaggggaaa cttttgacta 
ggagaagggc aagtttcccg tgcagttaga 
ggtggcctta agaacttcat tagagctttg 
tgagtctgta cggaaccagg tggaggatct 
tgtccctgct gtcatcctca ggaggccaaa 
gcagcgattt cttcgagctc gaaacatctc 
cttccataac aaatattacc agagtattta 
tcccgaatgg ctgagccctg aagaggacct 
ggcagatgtg gccacggtgc tgggacgtgc 
cagcgacaca gttcaggctg atccccaaac 
taaagccaac aactcatggt tccagtctat 



tgacccggga agtcggggtc tccttcgcct 60 
cttcctggac ttcgagcctg accctctccc 120 
accccagcca cccaccccac agtcgaaagg 180 
gaggaagata tatatcccct taaataaaac 240 
tcatcagatt ggctgccagt cttcaattag 3 00 
gaaagaggag gacctacagt gggtattgac 3 60 
gctggagagc aagcatttta ccagggattt 420 
aattgctggt cttgcagtgt ccttgaccaa 480 
tgtacagtgc ccaaatgatg ggtttggtgt 540 
tcactgcaga gaaatacagt ggaattcgct 600 
tttccccatc tttcttcttg aagatgaaaa 660 
agatcacaac ctgagtcaga atggctcagc 720 
cttttcacac atgcatgctg tcatcagcac 780 
aagcaccttc agcatcaacc cagaaatcgt 840 
gagcatgcta aagcctataa atacaactgg 900 
tgctgccacc cggctggata gtcgttcctt 960 
cgcagtggct tcctttgtca cccagctggc 1020 
tgtgaccacc ctgccccgca atgtcatgtt 1080 
cattggcagc tcgaggatgg tctacgatat 1140 
gaatgttgac tcatttgtgg agctgggaca 1200 
gatgcacaca gatcctgttt ctcagaaaaa 1260 
cctggccaca ttggagaaga gtggtgctgg 1320 
tcagtcccag cctctcccac catcttccct 1380 
tggcgttgtt ctggctgacc actctggtgc 1440 
cgacactgct gagaacatta atgtgagcta 1500 
gaactttgta acagacactg ccaaggccct 1560 
tctgtatgag cttgcaggag gaaccaactt 1620 
ggttacccgc ctgctctatg ggttcctgat 1680 
cctcaggcag gacctaaggt cctacttggg 1740 
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tgacgggcct 
acagtatgcc 
ggatccaagt 
ccctttgcat 
agccagggcc 
atggactgag 
gcttgagttg 
ctactgcatc 
gtcatactga 
ctgtcccact 
ttgggattaa 
cctcccttcc 
tcatgccaga 
cccaccctaa 
ctgctcctca 
tttaatgtca 
ctactgtctc 
tgctgcgggt 
gtaatatcta 
gagtttcatt 
tagaactagt 



cttcaacatt 
ttggcaaatt 
aaagtcccaa 
tctaatgaga 
ttgtctcctg 
agccgctgga 
atcaccctga 
aatgccaaag 
ggaggacccc 
gggacacaac 
cataaaagag 
tccgctcccc 
ttttgggatt 
tttgcccttc 
cccccacccc 
gggtcaaact 
cttcccactg 
gggtatccca 
tttttgtttt 
aaaatagatt 



acatcgctgt 
tgactggcac 
gtgaaaacaa 
cggaccgact 
cctttgaact 
aagatatccg 
cagtgggctt 
ctgatgtcct 
agcttttctt 
cactaatttg 
tggaactatc 
tttcccatca 
acaaatagaa 
aggacccttc 
tgtacccagc 
acattgagcc 
tcctttctcc 
cctccagccc 
tgtctttttc 
atcccacaaa 



ctccagcccc 
agtggtcaac 
ggatctgtat 
cccccggtgt 
gagtcagtgg 
tgcccggata 
cggcatcctc 
tttcattgct 
gccagctcag 
tcactggaac 
caaaagagac 
ccccttcccc 
gcttcttgct 
tactttttcc 
caccttcctg 
cctgaggaca 
aggccctcag 
acagtgctca 
ctttattctt 
aaaaaaaaaa 



accaacacca 
ctcacccgag 
gagtactcat 
gtgcgttcta 
agctctactg 
tttctcatcg 
atcttctccc 
ccccgggagc 
cagttcactt 
ctccctgggc 
agggagaaat 
atttcctctt 
cctgtttaac 
ttcctgccct 
actgggaagg 
ggggcatctc 
atggcacatt 
gttgtacttt 
tttgtaaata 
aagggcggcc 



cttatgttgt 
agcagtgcca 
gggtccaggg 
ctgcacgatt 
aatactctac 
ccagcaaaga 
tcatcgtcac 
caggagctgt 
cctagagcat 
ctgtctcaga 
aaataaattg 
ccttctctac 
tccctagtta 
gtacctctct 
acataaaagg 
tgggctgagc 
agggtgggcg 
ttattaagct 
tatatataat 
gctcgcgatc 



1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
2951 



<210> 107 
<211> 1832 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 
<223> Incyte ID No: 



7504509CB1 



<400> 107 

cccacgcgtc 

cctgaagggg 

ggggcgggcc 

ggtggccatg 

caagttcgtg 

ctgagaaatg 

tcatagcttt 

cccacacgga 

tccagcaagt 

acctgactct 

agctggtgca 

gaaatatcaa 

cagccatcca 

atgagcggca 

aggacatagt 

cctatgatga 

tgcaggcctt 

gcatggtcac 

cactgctgac 

tcgaggaagc 

ccaactacct 

tcaaggtgaa 

aggtggccaa 

gtggaaccac 

gcagccccca 

agaccctcac 

tagtggattg 

gttctcagct 

acgcttattt 

aaagttgggg 

tttaaatctc 



cgggcggcgc 
cacgcggggg 
gagcccggag 
aacggtaagg 
aaccggagtc 
gaatattaag 
tgctgtaggg 
cagcccctgc 
cggtgtggag 
ggctggacgc 
cgtggagcgg 
cgagaacttt 
ggaggagctg 
ccattcggtc 
ggagatggag 
gttcatcttt 
gatagattcc 
actctatgac 
agagctggtg 
catacccaag 
ggacaagcat 
cagcaagcaa 
caaagtcaag 
cattggacct 
actggccatg 
cctcttcaag 
agccctcttg 
gagctgaagc 
ggagactaga 
agcaggtgga 
tgaaaaaaaa 



ggccggggaa 
acctgcaaag 
gccagatgag 
cccgcaaaga 
cctctccttt 
cccgagagca 
ggccagtacg 
ctccgggtga 
acctatggtg 
gtcattgtca 
cccattcttc 
gggcccaaca 
gagaagggga 
ctcatgtccc 
ctctgccttg 
gctcctcggc 
tgtgcaggcc 
aacgaagagg 
ctgcggcgga 
tccttcatga 
gaggagaacc 
cgctatgctt 
gtccccctgc 
atcttggctt 
cactctatcc 
ggcttctttg 
gaaagacttc 
tggattatta 
ggagtgggag 
ggaggccaca 
aaaaagaaaa 



gcgtgatgaa 
ctagtgaggg 
cggacacagc 
ggcggtgcag 
ccatggctgg 
agtacttcat 
tt.cctggcaa 
aacgtcggtc 
gtgggatctg 
agtgccctac 
gcatcccaca 
cagagatgca 
ctcctgagcc 
tgctctgtgc 
cagacaccca 
tggacaatct 
ctggctccct 
tggggtctga 
tctcagcctc 
tcagcgcaga 
accggccttt 
caaacgcggt 
aggatctcat 
ctcggctggg 
gggagatggc 
agctgttccc 
tctgccatcc 
aagtggattg 
ttgagcctgg 
ctcctgggag 
aa 



ggcctacgag 
gcggggcagg 
cccacgcgcg 
actgcggcta 
cttcagtgaa 
gaccaggaac 
tggcttcagc 
tcgccgcagc 
gagcacctgg 
ctcaggtcgg 
cctggccatc 
tctagtcccc 

agggcctctc 

ccatctgggg 
gcctgcggtc 
gcacagctgc 
ggccacagag 
gagtgcacag 
gtgccagcac 
catggcccat 
attccacaag 
gtcagaggcc 
ggtccggaat 
gctgcgggtg 
ctgcaccaca 
ttctctaagc 
ctttgcacct 
tcactcagac 
cttgaacctt 
ctgatggttt 



tgcggcgcgg 
cggcgcggtg 
gggccatgca 
aggaactcct 
ctcaaggaga 
tcctccacca 
ctcatcgggg 
caggtgggct 
tttgaccgtg 
ctggagcagc 
catctgcagc 
attcttgcca 
aatgctgtgg 
ctgagcccca 
ttgggtggtg 
ttctgtgccc 
cctcacgtgc 
ggagcacagt 
ccgacagcct 
gctgtgcatc 
ggccccgtga 
ctgatccgag 
gacaccccct 
ctggatttag 
ggagtcctcc 
cataatctct 
gagaggggaa 
tctccgtgct 
tggaaccaga 
taaatctggt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1832 
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<210> 108 
<211> 1471 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7506825CB1 



<400> 108 

cgacatccgg 

caggcgacct 

gcgtgagccc 

cccgaggagg 

cagcccaagc 

caccaaccct 

caaggaacat 

tgccaggggt 

gctgaaggtg 

ggaggttggg 

gagggcaggc 

ttatagtgag 

ctcacgcttc 

ggcattccgg 

cgtgacctcc 

caccatgagc 

ggagcagctg 

gaagtggatg 

tagccgggtc 

tgacaaccgc 

cacacctgtg 

ggtggacata 

cagctgagcc 

aggacctcct 

ctaacacaaa 



cctcggccac 
cgctggaccc 
agcgacagga 
agcacccatc 
ctgactatgg 
acactctcct 
tggagtcacg 
gcccaggaca 

gagggccacc 

ggtcaccaag 
tttgccctgg 
cggagtccct 
atggaggaca 
gagaaggaat 
gtgaacctga 
gccagctttg 
cagagctggt 
cacccccaag 
tgcaaggata 
tatatccgcg 
ctgctgcacg 
tatacacgcc 
ctggaactcc 
cttccccctt 
aaaaaaaaaa 



gtggtgggcg 
taagtccagg 
gagtgagctc 
ggtgacgctc 
tggcacctgg 
ccatcttgct 
acccctttga 
tgaagtgcgt 
ggttccccag 
gcatggagct 
atgagggcat 
ggtgggtgcg 
cagcagcaga 
ggcagaggct 
ctaagctaga 
acttccgtgt 
gccaggcagc 
tgacacctac 
tgaacctcac 
cggtgggggt 
accacgatga 
tgctgcctgc 
taaacctttg 
ccaaataata 
aaaaaggggc 



gaccggggcg 
ccacagtcag 
accacgcgca 
ttccgccagt 
ctatgtggtg 
caactcccac 
ggccttcaag 
cagcatccag 
aaccatccac 
gttcgtgcag 
agccaatccc 
ggttaccagc 
gaagctgcac 
gcagtcaaac 
gggtggcgtg 
ggcaccggat 
tggcgagggg 
tgatgactca 
tctggagcct 
cccagctcta 
acggctgcat 
ccttgccagt 
cccctggggc 
aagtctatgg 

g 



gtcctgagcc 
ggaagggcgc 
gcgccatgac 
acctgcgtat 
accgtgttga 
acggatgtgg 
gattctgagg 
tacctggaag 
atgacctttg 
cggcctgagt 
actgatgcct 
actgggaggc 
aaggttgtaa 
ccccacctga 
gcctataacg 
gtggacttca 
gtcaccctag 
aacccttggt 
gagatcatgc 
ggcttctcac 
gaggctgtgt 
gtgcctgccc 
ttccatccca 
acagggctgt 



tgcgacctcg 
tgagaggcga 
cagcaagggt 
ccgcactgtc 
cctggccagg 
tgcctgtctt 
gctacatcta 
ctgtgaggag 
tgcctgatga 
tccacgccct 
tcactgtctt 
caggccatgc 
actccatcct 
aagaggggtc 
tgatacctgc 
aggcttttga 
agtttgctca 
gggcagcttt 
ctgctgccac 
ccatgaaccg 
tcctccgtgg 
tgcccagtga 
accagtgcca 
ctctgaagta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140. 

1200. 

1260 

1320 

1380 

1440 

1471 



<210> 109 
<211> 1344 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7506828CB1 



<400> 109 

cgacatccgg cctcggccac gtggtgggcg gaccggggcg gtcctgagcc tgcgacctcg 60 

caggcgacct cgctggaccc taagtccagg ccacagtcag ggaagggcgc tgagaggcga 120 

gcgtgagccc agcgacagga gagtgagctc accacgcgca gcgccatgac cagcaagggt 180 

cccgaggagg agcacccatc ggtgacgctc ttccgccagt acctgcgtat ccgcactgtc 240 

cagcccaagc ctgactatgg agctgctgtg gctttctttg aggagacagc ccgccagctg 3 00 

ggcctgggct gtcagaaagt agaggtggca cctggctatg tggtgaccgt gttgacctgg 3 60 

ccaggcacca accctacact ctcctccatc ttgctcaact cccacacgga tgtggtgcct 420 

gtcttcaagg aacattggag tcacgacccc tttgaggcct tcaaggattc tgagggctac 480 

atctatgcca ggggtgccca ggacatgaag tgcgtcagca tccagtacct ggaagctgtg 540 

aggaggctga aggtggaggg ccaccggttc cccagaacca tccacatgac ctttgtgcct 600 

gatgaggagg ttgggggtca ccaaggcatg, gagctgttcg tgcagcggcc tgagttccac 660 

gccctgaggg caggctttgc cctggatgag ggcatagcca atcccactga tgccttcact 720 
gtcttttata gtgagcggag tccctggtgg gtgcgggtta ccagcactgg gaggccaggc 780 

catgcctcac gcttcatgga ggacacagca gcagagaagc tggcttttga ggagcagctg 840 

cagagctggt gccaggcagc tggcgagggg gtcaccctag agtttgctca gaagtggatg 900 

cacccccaag tgacacctac tgatgactca aacccttggt gggcagcttt tagccgggtc 960 

tgcaaggata tgaacctcac tctggagcct gagatcatgc ctgctgccac tgacaaccgc 1020 
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tatatccgcg cggtgggggt cccagctcta ggcttctcac ccatgaaccg cacacctgtg 1080 
ctgctgcacg accacgatga acggctgcat gaggctgtgt tcctccgtgg ggtggacata 1140 
tatacacgcc tgctgcctgc ccttgccagt gtgcctgccc tgcccagtga cagctgagcc 1200 
ctggaactcc taaacctttg cccctggggc ttccatccca accagtgcca aggacctcct 1260 
cttccccctt ccaaataata aagtctatgg acagggctgt ctctgaagta ctaacacaag 1320 
gacaaaaaaa aaaaaaaaaa aaaa 1344 



<210> 110 
<211> 1429 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> inisc_f eature 

<223> Incyte ID No: 7506831CB1 



<400> 110 

cgacatccgg 

caggcgacct 

gcgtgagccc 

cccgaggagg 

cagcccaagc 

acggatgtgg 

gattctgagg 

tacctggaag 

atgacctttg 

cggcctgagt 

actgatgcct 

actgggaggc 

aaggttgtaa 

ccccacctga 

gcctataacg 

gtggacttca 

gtcaccctag 

aacccttggt 

gagatcatgc 

ggcttctcac 

gaggctgtgt 

gtgcctgccc 

ttccatccca 

acagggctgt 



cctcggccac 
cgctggaccc 
agcgacagga 
agcacccatc 
ctgactatgg 
tgcctgtctt 
gctacatcta 
cfcgtgaggag 
tgcctgatga 
tccacgccct 
tcactgtctt 
caggccatgc 
actccatcct 
aagaggggtc 
tgatacctgc 
aggcttttga 
agtttgctca 
gggcagcttt 
ctgctgccac 
ccatgaaccg 
tcctccgtgg 
tgcccagtga 
accagtgcca 
ctctgaagta 



gtggtgggcg 
taagtccagg 
gagtgagctc 
ggtgacgctc 
caccaaccct 
caaggaacat 
tgccaggggt 
gctgaaggtg 
ggaggttggg 
gagggcaggc 
ttatagtgag 
ctcacgcttc 
ggcattccgg 
cgtgacctcc 
caccatgagc 
ggagcagctg 
gaagtggatg 
tagccgggtc 
tgacaaccgc 
cacacctgtg 
ggtggacata 
cagctgagcc 
aggacctcct 
ctaacacaag 



gaccggggcg 
ccacagtcag 
accacgcgca 
ttccgccagt 
acactctcct 
tggagtcacg 
gcccaggaca 

gagggccacc 

ggtcaccaag 
tttgccctgg 
cggagtccct 
atggaggaca 
gagaaggaat 
gtgaacctga 
gccagctttg 
cagagctggt 
cacccccaag 
tgcaaggata 
tatatccgcg 
ctgctgcacg 
tatacacgcc 
ctggaactcc 
cttccccctt 
gacaaaaaaa 



gtcctgagcc 
ggaagggcgc 
gcgccatgac 
acctgcgtat 
ccatcttgct 
acccctttga 
tgaagtgcgt 
ggttccccag 
gcatggagct 
atgagggcat 
ggtgggtgcg 
cagcagcaga 
ggcagaggct 
ctaagctaga 
acttccgtgt 
gccaggcagc 
tgacacctac 
tgaacctcac 
cggtgggggt 
accacgatga 
tgctgcctgc 
taaacctttg 
ccaaataata 
aaaaaaaaa 



tgcgacctcg 
tgagaggcga 
cagcaagggt 
ccgcactgtc 
caactcccac 
ggccttcaag 
cagcatccag 
aaccatccac 
gttcgtgcag 
agccaatccc 
ggttaccagc 
gaagctgcac 
gcagtcaaac 
gggtggcgtg 
ggcaccggat 
tggcgagggg 
tgatgactca 
tctggagcct 
cccagctcta 
acggctgcat 
ccttgccagt 
cccctggggc 
aagtctatgg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1429 



<210> 111 
<211> 2187 
<212> DKIA 

<213> Homo sapiens 



<220> 

<221> mis cofeature 

<223> Incyte ID No: 7509968CB1 



<400> 111 

gctgacgctc ctcgtgggcc cctcactcag gccccgcccg ccctgccctc ccctcccgat 60 
cccggagcca tgtggcccct ggccctagtg atcgcctccc tgaccttggc cttgtcagga 120 
ggtgtctccc aggagtcttc caaggttctc aacaccaatg ggaccagtgg gtttctccca 180 
ggtggctaca cctgcttccc ccactctcag ccctggcagg ctgccctact agtgcaaggg 240 
cggctactct gtgggggagt cctggtccac cccaaatggg tcctcactgc cgcacactgt 300 
ctaaaggagg ggctcaaagt ttacctaggc aagcacgccc tagggcgtgt ggaagctggt 360 
gagcaggtga gggaagttgt ccactctatc ccccaccctg aataccggag aagccccacc 420 
cacctgaacc acgaccatga catcatgctt ctggagctgc agtccccggt ccagctcaca 480 
ggctacatcc aaaccctgcc cctttcccac aacaaccgcc taacccctgg caccacctgt 540 
cgggtgtctg gctggggcac caccaccagc ccccagggta tgcacccaca caggtggcct 600 
gaggccccat aggagtggct ggggaaacag gggcagagat gggagggaag gtctgagtga 660 
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attaccccaa 
aagtctaccc 
aagactcctg 
tcgtctcctg 
tctcaagata 
aatggttgaa 
gacttcctca 
accagtgatc 
attcaggcac 
cctgaatgtc 
tgagcccaga 
cccatctcaa 
caagtattta 
agactcccat 
caacattctg 
ttctcacact 
aataccccac 
atagggttca 
gtcttgtatc 
atgaccctaa 
gtgctttctc 
ataaatgtca 
aaaaaaaaaa 
cgctagaaat 
ctagagaggg 
ggggcaggca 



aactctacaa 
aggaaagatc 
tgagggtgac 
gggagacttc 
cgtcctgtgg 
gggcccacaa 
catggtctgc 
catgtcctga 
tgccaggctt 
ccaacccaga 
tcaccacctg 
caatatatgt 
ttcaatgcca 
gatagttcat 
tgtacctctg 
gttatgggtg 
ctctgccaac 
gtataccaaa 
ccctccatga 
aatgtagtgc 
atctgtaaaa 
agcatttatc 
aaaaaaaaaa 
gagacaccct 
ggcagaggga 
ccacctcccc 



tgtgccaaca 
actgacaaca 
tctgggggcc 
ccatgtgggc 
atccgtgaaa 
taaaagttga 
ttagcccttc 
aaaatgctca 
gcagtctccc 
caatggccca 
aacgttctgt 
cctataaatg 
gtatcctaga 
cctgaaaata 
tgactcacca 
tctcagaagt 
attccctctc 
gggtcttctt 
acatccttgg 
aaatccttgc 
tgggttcata 
actaaaaaaa 
aaaaaaaaaa 
agcccccgcg 
acactataca 
taatgtg 



tccaacttcg 
tgttgtgtgc 
ccctggtctg 
aacctgaccg 
caatccgaaa 
gaaatgtacc 
tctgctcctt 
atctcagcta 
agatgttgca 
ggtctctcaa 
ctgtggcaca 
gaccatcctt 
ccttctattt 
ttctcttatg 
cagctaacat 
cccaacccaa 
tgaatcaatg 
gcctgaactt 
tcagtgtccc 
tttggacaag 
atttttttta 
aaaaaaaaaa 
aaaaaacaca 
aggacataat 
ccccgcatgt 



ctcagatgag 
cggcacaaaa 
taacagaaca 
gcctggtgtc 
atatgaaacc 
ggcttccatc 
attcccagtg 
acattccatg 
tccctgaaac 
cttcatcagt 
ttcttaaata 
gacaacatcc 
tttgcactca 
cccacaatct 
tggatcctca 
cctacatccc 
gcaccctagt 
tattgtctac 
ttgctgttac 
ttataaaact 
attgtaacat 
aaaaaaaaaa 
aaacaggggg 
atcgccggca 
ataacgcgca 



gagtgtcgtc 
gagggtggca 
ctgtatggca 
tacacccgtg 
cagcagcaaa 
ctgtcaccat 
ttccatttga 
tttcagaagc 
atctcaacaa 
gtggcttcta 
tttccatcag 
tctaactctt 
agaaggctct 
tctgccctga 
gaatatttca 
acattcttcc 
ctctagagtt 
caaatattcc 
atctttgtgc 
cacagtctct 
tattacaaga 
aaaaaaaaaa 
ggggcggcgg 
gcgagcacca 
cacacagagg 



720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2187 



<210> 112 
<211> 2020 
<212> DNA 

:<213> Homo sapiens 

<220> 

<221> inisc_f eature 

<223> Incyte ID No: 7510232CB1 



<400> 112 

tgaaggctgg 

agaaggtgac 

gcatctgctg 

ttggccgcct 

cctgcctttt 

cagtgaagat 

agaacccatc 

ggaaagcaat 

gattacctac 

ggctcaaatt 

ccagtgccag 

catgctctat 

tttgcaagag 

catagagatg 

ctgtgcccaa 

aggggaggag 

atcacagact 

catcaacaat 

ggacaggaat 

tatttcttat 

ggccatgttt 

cctttgaaga 

atgagatttt 

gtatcctctc 

tctatgagct 

aagtgttttt 



ttgttcagac 
tggtggctgc 
tctgagcagc 
gagcccttga 
aaaaagatgg 
ctggcctccc 
aaggatgcct 
ctgtccttcc 
ctaaacacta 
tctgcctaca 
acacagtctg 
cagatttcag 
gaaatctcca 
gagaagaggg 
atcaacaaga 
ttgtgtgggg 
ttggacaaag 
cacaattttg 
ggaacacact 
gcctattttt 
caagaaaatg 
gcttcatttt 
gaaaatctac 
ccatggagac 
gacatctcag 
tattcaggct 



tgagcttcct 
ctgaggaata 
ccctgggtgc 
gttggtcact 
acttcagcag 
tcaagttcct 
tgatgttatt 
tgaaggagct 
gaaaggagga 
ggttcca^ctt 
tacctttctg 
aagaagtgag 
aatgcaaact 
tcatcctggg 
gcctgctgaa 
taatgacaat 
tttaccaaat 
caaaagcacg 
tggatgcagg 
ttttaaatca 
gatttaaaca 
gagatcaagc 
caactcatgg 
aaaggcatca 
ttcactggtt 
tgtcaggggg 



gcctgcctgt 
ccagtgggca 
gtccactttc 
tgaaccttgg 
aaatctttat 
gagcctggac 
ccagagactc 
gctcttccga 
gatggaaagg 
ctgccgcatg 
gcggagggtc 
cagatcagaa 
ggatgatgac 
agaaggaaag 
gataatcaac 
ctcggactct 
gaaaagcaaa 
ggagaaagtg 
tacagtagaa 
aaagggagag 
tatttccctg 
cccacgatga 
accacagtaa 
tctatggcac 
tgaagtgccc 
ataactacca 



accccgccaa 
agagaattag 
tgggcacgtg 
gaatattgag 
gatattgggg 
tacattccgc 
caggaaaaga 
attaatagac 
gaacttcaga 
agctgggctg 
gatcatctat 
ttgaggtctt 
atgaacctgc 
ttggacatcc 
gactatgaag 
ccaagagaac 
cctcggggat 
cccaaacttc 
cccaaaagag 
aacaaaagct 
tggaggggct 
ctgcacagta 
catggactgc 
tgatggacag 
ttcccttgct 
gaaaggtata 



cagcttcaga 
catttctgga 
aggttgggcc 
attatattct 
aacaactgga 
aaaggaagca 
gaatgttgga 
tggatttgct 
caccaggcag 
aagcaaacag 
taataagggt 
ttaagtttct 
tggatatttt 
tgaaaagagt 
aattcagcaa 
aggatagtga 
actgtctgat 
acagcattag 
aaaagtaaaa 
ataccaaaag 
ttgaccacga 
gagcaaatct 
ttcatctgct 
gaggccccca 
ggaaaaccca 
cctgttgaga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 
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ctgattcaga 
tcccggatga 
gaaaccctgc 
gtcctcgagg 
aggatgacaa 
aacttgtctt 
ttgagacaga 
cgcaagctcc 



ggagcaaccc 
ggctgacttt 
agagggaacc 
cgatgatatt 
gaaaaacatg 
cccttctgat 
atctcgctct 
gcctcccggg 



tatttagaaa 
ctgctgggga 
tggtacatcc 
ctcaccatcc 
gggaaacaga 
tgatggtgct 
gtcgcccagg 
ttcaggccat 



tggatttatc 
tggccactgt 
agtcactttg 
tgactgaagt 
tgcctcagcc 
attttgtttg 
ctggagtgca 
tctcctgcct 



atcacctcaa 
gaataactgt 
ccagagcctg 
gaactatgaa 
tactttcaca 
ttttgttttg 
gtggcgtgat 



acgagatata 
gtttcctacc 
agagagcgat 
gtaagcaaca 
ctaagaaaaa 
ttttgttttt 
ctcggctcac 



1620 
1680 
1740 
1800 
1860 
1920 
1980 
2020 



<210> 113 
<211> 2167 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 7510233CB1 



<400> 113 

tgaaggctgg ttgttcagac tgagcttcct gcctgcctgt accccgccaa cagcttcaga 60 
agaaggtgac tggtggctgc ctgaggaata ccagtgggca agagaattag catttctgga 120 
gcatctgctg tctgagcagc ccctgggtgc gtccactttc tgggcacgtg aggttgggcc 180 
ttggccgcct gagcccttga gttggtcact tgaaccttgg gaatattgag attatattct 240 
cctgcctttt aaaaagatgg acttcagcag aaatctttat gatattgggg aacaactgga 300 
cagtgaagat ctggcctccc tcaagttcct gagcctggac tacattccgc aaaggaagca 360 
agaacccatc aaggatgcct tgatgttatt ccagagactc caggaaaaga gaatgttgga 420 
ggaaagcaat ctgtccttcc tgaaggagct gctcttccga attaatagac tggatttgct 480 
gattacctac ctaaacacta gaaaggagga gatggaaagg gaacttcaga caccaggcag 540 
ggctcaaatt tctgcctaca ggttccactt ctgccgcatg agctgggctg aagcaaacag 600 
ccagtgccag acacagtctg tacctttctg gcggagggtc gatcatctat taataagggt 660 • 
catgctctat cagatttcag aagaagtgag cagatcagaa ttgaggtctt ttaagtttct 72 0 " 
tttgcaagag gaaatctcca aatgcaaact ggatgatgac atgaacctgc tggatatttt 780 
catagagatg gagaagaggg tcatcctggg agaaggaaag ttggacatcc tgaaaagagt 840 
ctgtgcccaa atcaacaaga gcctgctgaa gataatcaac gactatgaag aattcagcaa 900 
agagagaagc agcagcctgg aaggaagtcc tgatgaattt tcaaatgggg aggagttgtg 960 
tggggtaatg acaatctcgg actctccaag agaacaggat agtgaatcac agactttgga 1020 
caaagtttac caaatgaaaa gcaaacctcg gggatactgt ctgatcatca acaatcacaa 1080 
ttttgcaaaa gcacgggaga aagtgcccaa acttcacagc attagggaca ggaatggaac 1140 
acacttggat gcaggggctt tgaccacgac ctttgaagag cttcattttg agatcaagcc 1200 
ccacgatgac tgcacagtag agcaaatcta tgagattttg aaaatctacc aactcatgga 1260 
ccacagtaac atggactgct tcatctgctg tatcctctcc catggagaca agggcatcat 1320 
ctatggcact gatggacagg aggcccccat ctatgagctg acatctcagt tcactggttt 1380 
gaagtgccct tcccttgctg gaaaacccaa agtgtttttt attcaggctt gtcaggggga 1440 
taactaccag aaaggtatac ctgttgagac tgattcagag gagcaaccct atttagaaat 1500 
ggatttatca tcacctcaaa cgagatatat cccggatgag gctgactttc tgctggggat 1560 
ggccactgtg aataactgtg tttcctaccg aaaccctgca gagggaacct ggtacatcca 1620 
gtcactttgc cagagcctga gagagcgatg tcctcgaggc gatgatattc tcaccatcct 1680 
gactgaagtg aactatgaag taagcaacaa ggatgacaag aaaaacatgg ggaaacagat 1740 
gcctcagcct actttcacac taagaaaaaa acttgtcttc ccttctgatt gatggtgcta 1800 
ttttgtttgt tttgttttgt tttgtttttt tgagacagaa tctcgctctg tcgcccaggc 1860 
tggagtgcag tggcatgatc tcggctcacc gcaagctccg cctcccgggt tcaggccatt 1920 
ctcctgcctc agcctcccga gtagctggga ctacaggggc ccgccaccac acctggctaa 1980 
ttttttaaaa atatttttag tagagacagg gtttcactgt gttagccagg gtggtcttga 2040 
tctcctgacc tcgtgatcca cccacctcgg cctcccaaag tgctgggatt acaggcgtga 2100 
gccaccgcgc ctggccgatg gtactattta gatataacac tatgtttatt tactaatttt 2160 
ctagatt 2167 



<210> 114 
<211> 1706 
<212> DNA 

<213> Homo sapiens 



<220> 
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<221> mis cofeature 

<223> Incyte ID No: - 75103 04CB1 



<400> 114 

ctcgagccga ttcggctcga gcccggcact acctcgaggc tccgccccca cctgctggac 60 

cccagggtcc caccctggcc caggaggtca gccagggaat cattaacaag aggcagtgac 120 

atggcgcaga aggagggtgg ccggactgtg ccatgctgct ccagacccaa ggtggcagct 180 

ctcactgcgg ggaccctgct acttctgaca gccatcgggg cggcatcctg ggccattggt 240 

gagagcggtt cctgtgcctc tgacctcccc tggcccctgc agcctccagc ctgcctgccc 3 00 

tcacccctcc cttctctgca gtggctgttc tcctcaggag tgaccaggag ccgctgtacc 3 60. 

cagtgcaaga cgtggcggct gctgtgctcc tcgcgctcca acgccagggt agccggactc 420 

agctgcgagg agatgggctt cctcagggca ctgacccact ccgagctgga cgtgcgaacg 480 

gcgggcgcca atggcacgtc gggcttcttc tgtgtggacg aggggaggct gccccacacc 540 

cagaggctgc tggaggtcat ctccgtgtgt gattgcccca gaggccgttt cttggccgcc 600 

atctgccaag actgtggccg caggaagctg cccgtggacc gcatcgtggg aggccgggac 660 

accagcttgg gccggtggcc gtggcaagtc agccttcgct atgatggagc acacctctgt 720 

gggggatccc tgctctccgg ggactgggtg ctgacagccg cccactgctt cccggagcgg 780 

aaccgggtcc tgtcccgatg gcgagtgttt gccggtgccg tggcccaggc ctctccccac 840 

ggtctgcagc tgggggtgca ggctgtggtc taccacgggg gctatcttcc ctttcgggac 900 

cccaacagcg aggagaacag caacgatatt gccctggtcc acctctccag tcccctgccc 960 

ctcacagaat acatccagcc tgtgtgcctc ccagctgccg gccaggccct ggtggatggc 1020 

aagatctgta ccgtgacggg ctggggcaac acgcagtact atggccaaca ggccggggta 1080 

ctccaggagg ctcgagtccc cataatcagc aatgatgtct gcaatggcgc tgacttctat 1140 

ggaaaccaga tcaagcccaa gatgttctgt gctggctacc ccgagggtgg cattgatgcc 1200 

tgccagggcg acagcggtgg tccctttgtg tgtgaggaca gcatctctcg gacgccacgt 1260 

tggcggctgt gtggcattgt gagttggggc actggctgtg ccctggccca gaagccaggc 1320 

gtctacacca aagtcagtga cttccgggag tggatcttcc aggccataaa gactcactcc 1380 

gaagccagcg gcatggtgac ccagctctga ccggtggctt ctcgctgcgc agcctccagg 1440 

gcccgaggtg atcccggtgg tgggatccac gctgggccga ggatgggacg tttttcttct 1500 

tgggcccggt ccacaggtcc aaggacaccc tccctccagg gtcctctctt ccacagtggc 1560; 

gggcccactc agccccgaga ccacccaacc tcaccctcct gacccccatg taaatattgt 1620 

tctgctgtct gggactcctg tctaggtgcc cctgatgatg ggatgctctt taaataataa 1680- 

agatggtttt gattaaaaaa aaaaaa 1706 



<210> 115 
<211> 2742 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> inisc_f eature 

<223> Incyte ID No: 7510461CB1 



<400> 115 

ggggcgtggg ccggcaggaa gatggcgaac gtggggctgc agttccaggc gagcgcgggg 60 
gactcggacc cacagagccg gcccctgctg ctgctcgggc agctgcacca cctgcaccgc 120 
gtgccctgga gccacgtccg cgggaagctg cagccccggg tcaccgagga gctctggcag 180 
gctgccctga gcacgctcaa ccccaacccc acggacagct gtcccctcta cctgaactac 240 
gccaccgtgg ctgccctgcc ctgcagggtg agccggcaca acagcccctc ggccgcccac 3 00 
ttcatcacgc ggctggtgcg gacctgcctg ccgcccggag cgcatcgctg cattgtgatg 3 60 
gtctgcgagc agccggaggt ctttgcttcc gcctgtgccc tggcccgggc cttcccgctg 420 
ttcacccacc gctcaggtgc ctctcggcgc ttggagaaga agacggtcac cgtggagttt 480 
ttcctggtgg gacaagacaa cgggccggtg gaggtgtcca cattgcagtg cttagcgaat 540 
gccacagacg gcgtgcggct agcagcccgc atcgtggaca caccctgcaa tgagatgaac 600 
accgacacct tcctcgagga gattaacaaa gttggaaagg agctggggat catcccaacc 660 
atcatccggg atgaggaact gaagacgaga ggatttggag gaatctatgg ggttggcaaa 720 
gccgccctgc atcccccagc cctggccgtc ctcagccaca ccccagatgg agccacgcag 780 
accatcgcct gggtgggcaa aggcatcgtc tatgacactg gaggcctcag catcaaaggg 840 
aagactacca tgccggggat gaagcgagac tgcgggggtg ctgcggccgt cctgggggcc 900 
ttcagagccg caatcaagca gggtttcaaa gacaacctcc acgctgtgtt ctgcttggct 960 
gagaactcgg tggggcccaa tgcgacaagg ccagatgaca tccacctgct gtactcaggg 1020 
aagacggtgg aaatcaacaa cacggatgcc gagggcaggc tggtgctggc agatggcgtg 1080 
tcctatgctt gcaaggacct gggggccgac atcatcctgg acatggccac cctgaccggg 1140 
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gctcaggtga gtgctccctg gatccacccc ttagctgtag tcccagggaa ccccacccca 1200 
ctcttgacct caaggtgggg aaggcagcgc gtgggggctg ccgtcaggaa gttttcataa 1260 
ctgggcacga ggaggcagga ggagctgccc gtcaagcttg gtgtgggcag taccgaggcc 1320 
caggctggat gcagagccgg gaaccgcctc ctgtgtgctt catggccatg gtcctttctg 13 80 
cctgtgtttt ttcttttttt ctcaaccgtc tcttttctgg ctcccttatt tctctgtctg 1440 
cctcccggtc cctcttttgc cttgggtgtt tctctcctgc cgtcccgtcc acacgcttcc 1500 
cgggttcctg cccgcccagg gcattgccac agggaagtac cacgccgcgg tgctcaccaa 1560 
cagcgctgag tgggaggccg cctgtgtgaa ggcgggcagg aagtgtgggg acctggtgca 1620 
cccgctggtc tactgccccg agctgcactt cagcgagttc acctcagctg tggcggacat 1680 
gaagaactca gtggcggacc gagacaacag ccccagctcc tgtgctggcc tcttcatcgc 1740 
ctcacacatc ggcttcgact ggcccggagt ctgggtccac ctggacattg ctgcaccggt 1800 
gcatgctggt gagcgagcca caggcttcgg tgtggccctc ctgctggcgc tcttcggccg 1860 
tgcctctgag gaccctctgc tgaacctggt gtccccactg ggctgtgagg tggatgtcga 1920 
ergagggggac ctggggaggg actccaagag acgcaggctt gtgtgagcct ccfegcctcgg 1980 
ccctgacaaa cggggatctt ttacctcact ttgcactgat taattttaag caattgaaag 2040 
attgcccttc atatgggttt tggtttgtct ttctggtcgt cagcgtggtg gtggaaacag 2100 
ctgaagtttt aggagacagc ttagggtttg gtgcgggcca cggggagggg accgggaagc 2160 
gctggggctt gtttctgttt gttacttaca ggactgagac atcttctgta aactgctacc 2220 
cctggggcct tctgcacccc ggggtgaggc ctcctgcctg cctggtgccc ' tgtcccagcc 2280 
ccaggtcctg tgcagggcac ctgcgtggct gacagccagg ctcttactcc agccggggct 2340 
gccagcgcat ccagccagcc cagccctgtg aaagatggag ctgacttgct gcaggggacc 2400 
tgatttatag ggcaagagaa gtcacactcc ggcctctcag aattcacttg aggfctcaatt 2460 
aaatacagtc acaccgcccc ctcaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2520 
aaaagaaaac aaaacacagc agcacccaca acacgaaaaa gaaaaaaaga aaaagaccag 2580 
cacaccagca gtaaacggga gaaccaaaaa aacggggggc gggacgcaaa gagacgcacg 2640 
cgcacagcag caagaaaaac gaacaacacg gcgggcgaac aacacaacaa cgagcgaaaa 2700 
gcaccacaca acgagtgggg aacagaccaa gcacgaatgg ga 2742 

<210> 116 

<211> 1362 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7510392CB1 

<400> 116 

cggaactttc gtctcgtggg tgcgaaaggt aaccgaagcg gctcaggaag gcagctgtca 60 

ctgagcccct ggaacagagc gagagtatcg taagtaacca ggctcagccg gtttctcagg 120 

ccgctctagt caaataaacc ataaagatca gactcgggct tcttcacttc cttctctccg 180 

tggtttcgcc attagcttcc ggttccgggg aggggccgag ttttcttcga agatttgggg 240 

ctccgcgata cagttaggat ggctgtagta cctctgctgt tgttgggggg tttgtggagc 300 

gctgtgggag cgtccagcct gggtgtcgtt acttgcggct ccgtggtgaa gctactcaat 360 

acgcgccaca acgtccgact gcactcacac gacgtgcgct atgggtcagg tagtgggcag 420 

cagtcagtga caggtgtaac ctctgtggat gacagcaaca gttactggag gatacggggg 480 

aagagtgcca cagtgtgtga gaggggaacc cccatcaagt gtggccagcc catccggctg 540 

acacatgtca acactggccg aaacctccat agtcaccact tcacttcacc tctttctgga 600 

aaccagttgc tctgtaaggt gatattgtga tcattttata gaggagaaaa caaaggctca 660 

aaggatttac agaagaaggg atcaagttga gattcaagga agtgagtgct tttggtgagg 72 0 

aaggtgaagg tgattatctg gatgactgga cagtgctctg taatggaccc tactgggtga 780 

gagatggtga ggtgcggttc aaacactctt ccactgaggt actgctgtct gtcacaggag 840 

aacaatatgg tcgacctatc agtgggcaaa aagaggtgca tggcatggcc cagccaagtc 900 

agaacaacta ctggaaagcc atggaaggca tcttcatgaa gcccagtgag ttgttgaagg 960 

cagaagccca ccatgcagag ctgtgaatct agaggctctg agccactgtt aacgcacaat 1020 

gttcacagac atctgttgct gcctcacctt gggatccctg ccacaagttc cttgggcagt 1080 

ggccatgtca ccattgagat gaagatatac aacagaaaat agtggctgtg tttggaagct 1140 

tcagccctgc acatttgaac tagtcactct cccagacttg cgtgggtcag ttctttctga 1200 

gtagaggact tgctggtaaa ggggcagatg ctttttatta gtactgataa aacaaactga 1260 

gggaaacatc cctcttagct gggaaacttt tactcttcag gagcttggca tcatggactg 1320 

ttaatgtatg tgafctttccc cctattttct ctctccaaaa aa 1362 
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